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TITLE OF THE INVENTION 

PYROPHOSPHOROLYSIS ACTIVATED POLYMERIZATION (PAP) 
BACKGROUND OF THE INVENTION 

[OOOIJ This invention relates lo nucleic acid polymerization and Amplification. In particular, it 
relates to a novel and general method for nucleic acid amplification, in which 
pyrophofiphoiolysis and polymerization arc aerially-coupled. The method has been adapted for 
allele-specific amplification and can greatly increase ihc specificity to detect an extremely rare 
allele in the presence of wild-type alleles. We refer to the method as pyrophoephorolysis 
activated polymerization (TAP). 

[0002] The publications and other materials ysed herein to illuminate the background of the 
invention or provids additional details respecting the practice, are incorporated by reference, and 
for convenience arc respectively grouped in the appended Bibliography. 

[0003] Multiple methods for dtnecting mutations present in less than 10% of cells (i.e. rare 
alleles) have been developed, including PCR amplification of specific alleles (PAS A), peptide 
nucleic acid (PNA) clamping blocker pClc. allelevspecific competitive blocker PGR, mismatch 
amplification mutation assay (MAMA), restriction i&agment-length polymorphism (RFLP)/PCR 
{Parsons suid Heftich, 1997) and QE-PCR (Rerun and Minamata, 1997). These methods: i) 
amplify the n«e allele selectively, ii) destroy the abundant wild-type allele, or in) spatially 
separate the rare allele from the wild-type allele. The specificity achievable under typical 
research/clinical conditions is 10"' (Parsons and Heflich, 1997), although a few publications 
reported higher specificity of detection (Pourzand and Cerutti, 1993; Knoll et al., 1996). These 
methods either do not generally achieve the higher specificity or are not suitable tor routine 
analysis. 

|0004f A robust method of detecting one mutant allele in iC-lO* wild-type alleles would be 
advantageous for many applications including detecting minimal residual disease (recurrence 
after remission or rare remaining cancer cells in lymph nodes and other neighboring tissues) and 
measurement of mutation load (the frequency and pattern of somatic mutations present in 
normal tissues). Individuals with a high mutation load may be at increased risk for cancer due to 
either environmental exposure or endogenous defects in any of hundreds of genes necessary lo 
maintain the integrity of the genome, For those individuals found to have a high mutation load, 
clues to etiology can be obtained by fofining Oie mutation pattern. 

[0005] There are many DNA sequencing meOiods and their variants, such as the Sangor 
sequencing using dideoxy tenuination and denaturing gel electrophoresis (Sanger et al., 1977), 
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Maxam- Gilbert sequencing using chemical cleavage and denaturing gel electrophoresis (Max am 
and Gilbert, 1977), pyre -sequencing detecting pyrophosphate (FPj) released during the DNA 
polymerase reaction (Ronaghi et a]„ 1998), and sequencing by hybridization (SBH) using 
oligonucleotides (Lysov et al„ 19S8; Bains and Smith, 1988; Drmanac et al., 19S9; Khrapko et 
al., 19S9; Pevsner et al., 1989; Southern et al., 1992). 

[0006] There are multiple gel-based methods for scanning for unknown mutations including 
single stranded conformation polymorphism (SSCP) and the SSCP-hybrid methods of dideoxy 
fingerprinting (ddF), restriction endonucleage fingerprinting (REF), and Detection Of Virtually 
All Mutations-SSCP {DOV AM-S), denaturing gradient gel electrophoresis (DGGE), denaturing 
HPLC (dHPLC) chemical or enzymatic cleavage (Sarkar et al., 1992; Lin and Sommer, 
Liu ct al., J99P; Mycra c| al„ 1985; Cotton et al., 1988; Liu et al., 1999; Burin et al., 2GO0; 
Spiegclman et aL, 2000), DOVAM-S and chemical cleavage reactions have been shown in 
blinded analyses to identify essentially all mutations (Buzin et al., 2000). dHPLC, which is 
based on reverse phase chromatography, also may identify essentially all mutations under 
appropriate conditione (O'Donovan et al., 1998; Qefher and Underhill^ 199S; Spiegelman el al., 
2000). Efforts are under way lo develop general scanning methods with higher throughput. 
[0007J Sequencing by hybridization (SBH) is being adapted to scanning or resequencing for 
unknown mutations on micro arrays (Southern, 1996). This continues to be a promising area of 
intense study. However it is not possible as yet to detect most microinsertions and deletions with 
this approach and the signal to noise ratio for single base changes precludes detection of 500% 
of single nucleotide changes (Hatia, 1999), Alternative approaches warrant exploration. 
[0008] ft is becoming increasingly apparent that in vivo chromatin structure is crucial for 
mammalian gene regulation and development. Stable changes in chromatin structure often 
involve changes in melhylation and/or changes in histone acetylation. Somatically heritable 
changes in chromatin structure are commonly called cpigenctic changes (Russo and Riggs, 
1996) and it is now clear that epigenetic "mistakes" or epimutations are frequently an important 
contributing factor lo the development Df cancer (Jones and Laird, 1999), 

[0009] One of the few methods for assaying in vivo chromatin structure, and the only method 
with resolution at the single nucleotide level, is tigation-mediated PCR (LM-PCR) (Mueller and 
Wold, 19S9; Pfeifer et al., 1989) and its variant of terminal transferase-mediated PCR(TD-PCR) 
(Komura and Riggs, 1998). Many aspects of chromatin structure can be determined by LM- 
PCR, such as the location of methylated cytosine residues, hound transcription factors, or 
positioned nucleo somes. It is readily apparent that LM-PCR works better with some primer sets 
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than with others. Thus, it is desired to develop a more robust method of measuring chromatin 
structure. 

[0010] Thus, it is an object of the present invention to develop alternative methods for 
amplification of DNA, for sequencing DNA and for analysis of chromatin structure. This object 
s is accomplished by the use of the novel pyrophosphorolysis activated polymerization (PAP) as 
described herein. PAP has the potential to enhance dramatically the specificity of the 
amplification of specific alleles, for resequencing DNA and for chromatin structure analysis, 

SUMMARY OF THE INVENTION 
it) [00 11 } The invention is a pyrophosphorolysis activated polymerization (PAP) method of 
synthesizing a desired nucleic acid strand on a nucleic acid template strand. The method 
comprises the following steps carried out serially. 

[0012] (a) Annealing to tfje template strand a complementary activatable oligonucleotide P*. 
This activatable oligonucleotide has a non -extendible 3 T terminus that is activatable by 

is pyrophosphorolysis (hereinafter referred to as a non-extendible 3' terminus or a 3' non- 
extendible end or a non-extendible 3* end). The non-extendible 3" temunus {or end) is a 
nucleotide or nucleotide analog which has the capacity to form a Watson-Crick base hair with a 
complementary nucleotide and which lacks a 3' OH capable of being extended by a nucleic acid 
polymerase. In one embodiment, the non-extendible 3' terminus may be a non-extendible 

2 o 3'deoxynucleotide, such as a dideoxynucleotide. In a second embodiment, the non-estendible 3' 
terminus may be a chemically modified nucleotide lacking the 3' hydro xyl group, such as an 
acyclonucleotide, Aeyclonucleotides substitute a 2-hydroxyethoxymethyl group for the 2'- 
deoxyribofuranosyl sugar normally present in dNMPs. In other embodiments, the non- 
extendible 3* terminus may be other blockers as described herein. In one embodiment, the 

25 activatable oligonucleotide P* has no nucleotides at or near ils 3" terminus that mismatch the 
corresponding nucleotides on the template strand. In a second embodiment, the activatable 
oligonucleotide P* has a mismatch at or within 16 nucleotides of its 3' terminus with respect to a 
corresponding nucleotide on the template strand. The terminal 3''deoxynucleotide is hybridized 
to the template strand when the oligonucleotide P* is annealed. 

30 [0013] (b) Pyrophosphorolyzing the annealed activatable otigonucleotide P* with 
pyrophosphate and an enzyme that has pyrophosphorolysis activity. This activates the 
oligonucleotide F* by removal of the hybridized non-extendible 3' terminus. 
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[0014] (c> Polymerizing by extending the activated oligonucleotide P* on the template strand in 
jprescnuc of four nucleoside triphosphates of their analogs and a nucleic acid polymerase to 
synthesize the desired nucleic acid strand. 

[0015] The PAP method can be applied to amplify a desired nucleic acid strand by the following 
additional steps. 

[001 6] (d) Separating the desired nucleic acid strand of step (c) from the template strand, and 
[0017| (e) Repeating steps (a)-(d> until a desired level of amplification of the desired nucleic 
acid a hand is achieved. 

[0013] In a preferred aspect, the PAP method as described above is applied to allele-specific 
amplification (PAP- A). In this application, the nucleic acid template strand is a sense or 
antisense strand of one allele and is present in admixture with the corresponding (sense or 
antisense) nucleic acid strand of the second allele {the allelelic strand). The activatable 
oligonucleotide P+ has at least one nucleotide or analog at or near its 3* tenninus, e.g., within 16 
nucleotides of the 3* tenninus, that mismatches the corresponding nucleotide of the allelic strand. 
Because of the mismatch, in step (a) of the PAP method the uon-extendible 3' terminus of 
oligonucleotide P* is not substantially hybridized to the allelelic strand. In step (b) the 
pyrophosphorolysis does not substantially remove the non-hybridized non-extendible 3' 
terminus from the activatable oligonucleotide P* annealed to the allelic strand. In step (c) the 
oligonucleotide P* is not substantially extended by polymerization on the allelic strand. As a 
result, the desired nucleic acid strand synthesized on the template strand is amplified 
preferentially over any nucleic acid strand synthesized on the allelelic strand. 
[0019] In a second preferred aspect, the PAP-A method described above can be performed 
bidirectional ly (Bj*PAP-A). Bidirectional-PAP (Bi-PAP) is a novel design that preferably uses 
two opposing pyrophosphorolysis activatable oligonucleotides (P*) with one nucleotide overlap 
at their 3' termini. Thus, in Bi-PAP, PAP-A is performed with a pair of opposing activatable 
oligonucleotide P*s. Both Ihe downstream and upstream P*s are specific for the nucleotide of 
interest at the 3' termini (eg., an A:T base pair). In the initial round of amplification from 
genomic DNA, segments of undefined size are generated. In subsequent rounds, a segment 
equal to the combined lengths of the oligonucleotides minus one is amplified exponentially. 
Nonspecific amplification occurs at lower frequencies because this design eliminates 
misincorporation error from an unblocked upstream. The P*s may be 30-60 nucleotides for 
most efficient amplification. 



[0020] The PAP method can be used to amplify either UNA or DNA. When used to amplify 
DNA, the activatable oligonucleotide P* may be a 2 , -deoxyoligonucleotide, the nou-extendible 
3' terminus may be, e.g. r a 2 f ,3 , -dideoxynucleotide or an acyclonucleotide or other blockers as 
described herein, the four nucleoside triphosphates are 2'-deoxynucieoside triphosphates or their 
s analogs, and Uie nucleic acid polymerase is a DNA polymerase. The DNA polymerase used in 
step (c) can also be the enzyme having pyrophosphorDlysis activity used in Step (b). Preferred 
DNA polymerases having pyrophosphorolysis activity are thermostable Tfl t Taq, and genetically 
engineered DNA polymerases, such as AxnpliTaqVs and ThermoSequenase™. These 
genetically engineered DNA polymerases have the mutation F667Y or an equivalent mutation in 
10 their active sites. The use of genetically engineered DNA polymerases, such as AxnpMTaqfs 
and ThermoSequenase TW t greatly improves the efficiency of PAP. These Family I DNA 
polymerases can be used when the activatable oligonucleotide P* is a 3' dideoxynucleotide or an 
acyclonucleotide. When the activatable oligonucleotide P* is an acyclonucleotide, Family IT 
archaeon DNA polymerases Can also be used. Examples of such polymerases include, but are 
is not limited to, Vent (exo-) and Pfu (exo-), These polymerases efficiently amplify 
3 'acyclonucleotide blocked P*. Two or more polymerases can also be used in one reaction. If 
the template is RNA, the nucleic acid polymerase may be RNA polymerase, reverse 
transcriptase, or their variants. The activatable ougonucleotide P* may be a ribonucleotide or a 
Z'-deoxynucleotide. The non-extendible 3' terminus may be a 3" deoxyribonucleotide ot an 
2 0 acyclonucleotide. The four nucleoside triphosphates may be ribonucleoside triphosphates, 2' 
deoxynucleoside triphosphates or their analogs. For convenience, the description that follows 
uses DNA as the template. However, JRNA is also included, such as described for the present 
aspect. 

[0021] Amplification by the PAP method can he linear or exponential Linear amplification is 
2S obtained when the activatable oligonucleotide P* is the only complementary oligonucleotide 
used. Exponentiai amplification is obtained when a second opposing oligonucleotide, which may 
be a P* is present that is complementary to the desired nucleic acid strand. The activatable 
oligonucleotide P* and the second oligonucleotide flank the region that is targeted for 
amplification. In step (a) the second oligonucleotide anneals to the separated desired nucleic 
30 acid strand product of step (d). In step (c) polymerization extends the second oligonucleotide on 
the desired nucleic acid strand to synthesize a copy of the nucleic acid temp&te strand. In step 
(d) the synthesized nucleic acid template strand is separated from the desired nucleic acid strand. 
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Steps (a) through (d) are repeated until the desired level exponential amplification has been 
achieved. 

[0022] In the PAP method, a mismatch between the activatable oligonucleotide P* and the 
template strand results in no substantial amplification, if the mismatch occurs in the 3 1 specific 
subsequence of P+ at the 3' terminus of P* or within lo" nucleotides of the 3' terminus of P+. 
Tins lack of amplification for such mismatches in the 3' specific subsequence of P* provides 
• four billion different and specific oligonucleotides with one base substitution resolution. 
[0023J In a preferred aspect, the PAP method is used for exponential amplification of a rare, 
mutant allele in a mixture containing one or more wild-type alleles. Strands of the alleles are 
separated to provide single-stranded nucleic acid, and then the following steps are curried out 
serially. 

[0024] (a) Annealing lo the sense or antisense strands of each allele a complementary 
activatable 2'-deoxyotigoouc]pdtide P* that has a non-extendible 3' terminus. The non- 
extendible 3' terminus may be, e.g., a non-extendible 2',3'-dideoxynucleotide or an 
acyclonucleotide. P* lias no 2-deoxynucleotides at or near its 3" terrninus that mismatch the 
corresponding 2 , -deoxynucleoride$ on the mutant strand, but has at least one 2'.deoxynucleotide 
at or near its 3' terminus that mismatches the corresponding 2'-deoxynucleotide on the wild-type 
stand. Consequently, the non-extendible 3' terrninus is hybridized to the mutant strand but not to 
the wild-type strand when the oligonucleotide P* is annealed. Simultaneously, a second 
2'ndeoxyoligojauclecitide that is complementary to the anti-parallel strands of each allele is 
annealed to the anti-parallel strands. The activatable 2'^deoxyoligonueleoiide P* and the second 
2'-deoxyoligonucIeotide flank the region of the gene to be amplified. 

[0025] (b) Pyrophosphorolyzing the activatable 2Meoxyoligonuc]eoride P* that is annealed to a 
mutant strand with pyrophosphate and an enzyme that has pyrophosphorolysis activity. This 
activates the 2'-deoxyoligomicleotide P* that is annealed to the mutant strand by removal of the 
hybridized non-extendible 3' tenninus. It does not substantially activate the 
2'-deoxyoligonucleotide P* when it is annealed to the mutant strand because the non-hybridized 
non-extendible 3' terminus is not substantially removed by the pyrophosphorolysis. 
[0026} (c) Polymerizing by extending the activated oligonucleotide P* on the mutant strand in 
presence of four nucleoside triphosphates or their analogs and a DNA polymerase and extending 
the second 2.'-deoxyoligonucleotide on both mutant and wild-type anti-parallel strands. 
[0027] (d) Separating the extension products of step (c); 
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[0028] (e) Repeating steps <a>(d) until the desired level of amplification of the mutant allele has 
been achieved, 

[0029] The activatable Z'-deoxyoligonucleoh'de P+ is annealed to the antisense strands of the 

alleles and the second 2'-deoxyoligonucleotide is annealed to the sense strands, or vice versa, 
s [0030] Steps (a) to (c) of PAP can be conducted sequentially as two or more temperature stages 

on a rhermocycler, or they can be conducted as one temperature stags on a rhermocycler, 

[0031] Nucleoside triphosphates and ^-deoxynucleoside triphosphates or their chemically 
■modified versions may be used as substrates for multiple-nucieotide extension by PAP, i.e., 

when one nucleotide is incorporated tbe extending strand can be further extended. 
10 2' 1 3'-dideoxynucleo5ide triphosphates, their chemically modified versions, acyclonucleotides or 

other blocked nucleotides which are terminators for further extension may be used for singic- 
nucleotide extension. 2\3'-dideo)(ynucIeGsidc triphosphates may be labeled with radioactivity or 

dye for differentiation from the 3' terminal dideoxynucleotide, if present, of oligonucleotide p*. 
Mixtures of nucleoside triphosphates or Z-deoxynucleotide triphosphates or their analogs, and 
1 5 i'.S'-didcoxynucleoside triphosphates or their analogs may also be used. 

[0032] PAP can be used in a novel method of DNA sequence determination. In PAP, 
pyrophosphorolysis and polymerization by DNA polymerase are coupled serially by using P\ 
an oligonucleotide containing a non-extendible 3 T terminus. The non-extendible 3' terminus may 
be, e.g., a non-extendible 3*-deoxynucleotide or an acyclonucleotide. This principle is based on 
so the specificity of PAP and in turn on the base pairing specificity of the 3' specific subsequence. 
This property of the 3' specific subsequence can be applied to scan or resequence for unknown 
sequence variant*, to determine tfe novo DNA sequence, to compare two DNA sequences, and to 
monitor gene expression profiling in large scale, A P* array is possible in these methods. That 
is, each of the P*s can be hnmcbihzed at an individual dot or a solid support, thus allowing all 
2 s the PAP reactions to be processed in parallel. 

[0033] Thus in one aspect, the PAP method is used for scanning or resequencing unknown 
sequence variants within a predetermined sequence by carrying out the following steps serially. 
[0034] (a) Mixing under hybridisation conditions a template strand of the nucleic acid with 
multiple sets of four activatable oligonucleotides P* which are sufficiently complementary to the 
30 template strand to hybridize therewith. Within each set uie oligonucleotides P* differ, from 
each otaer in having a different non-extendible 3' terminus, so that the non-extendible 3' 
terminus i& hybridized to the template strand if the template strand is complementary to the non- 
extendible 3" terminus. The number of Bets corresponds to the number of nucleotides in the 
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sequence. The non-extendible 3" terminus may be, e.g., a non-extendible 3 '-deoxynucleotide or 
, ah acyclonucleotide. 

(0035] (b) Treating the resulting duplexed P"s with pyrophosphate and an enzyme that has 
pyrophosphorolysis activity to activate by pyrophosphoTO lysis only those oligonucleotides P* 
which have a non- extendible 3' terminus that is hybridized to the template strand. 
[0036] (c) Polymerizing by extending the activated oligonucleotides P* on the template strand 
in presence of four nucleoside triphosphates or their analogs and a nucleic acid polymerase. 
[0037] (d) Separating the nucleic acid strands synthesized in step (c) from the template strand. 
J0038] (e) Repeating steps {&)-{d) "nttf a desired level of amplification is achieved, and 
[Q039J (f> Arranging the nucleic acid sequence in order by analyzing overlaps of oligonuclotides 
P* that produced amplifications. 

[0040] In a second aspect, the PAP method is used for determining de novo the sequence of a 
nucleic acid by carrying out the following steps serially. 

[0041] (a) Mixing under hybridisation conditions a template strand of the nucleic acid with 
multiple activatable oligonucleotides P 4 . Ail of the oligonucleotides P* have the same number n 
of nucleotides as the template and constitute collectively all possible sequences having n 
nucleotides. All of the oligonucleotides P* have a non-eatendible 3' terminus. The non- 
extendible 3' terminus may be, e.g., a non- extendible 3 '-deoxynucleotide or an acyclonucleotide. 
Any oligonucleotides P* that are sufficiently complementary will hybridize to the template 
Strand. The non-extendible 3* terminus will hybridize to the template strand only if the template 
strand is complementary at the position corresponding to the 3* terminus. 

£0042] (b) Treating the resulting duplexed p*s with pyrophosphate and an enzyme that has 
pyrophosphorolysis activity to activate only those hybridized oligonucleotides P* which have a 
non-exf&ndible 3 1 terminus that is hybridized to the template strand, by pyrophoaphoro lysis of 
those hybridized non-extendible 3' tennini. 

[0043] (c) Polymerizing by extending the activated oligonucleotides P* on the template strand 
in presence of four nucleoside triphosphates or their analogs and a nucleic acid polymerase. 
[0044] (d) Separating the nucleic acid strands synthesized in step (c) from the template strand. 
[0O45] (e) Repeating steps (a)-(d) until a desired level of amplification has been achieved, and 
[0046] (f) Determining the sequence of oligonucleotides P* that produced amplifications, then 
arranging the nucleic acid sequence in order by analyzing overlaps of these oligonucleotides. 
[0047] PAP can also be used to study chromatin structure analogously to ligation-mediated PCR 
(LM-PCR) by carrying out the following steps serially. LM-PAP has been used for the 
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determination of primary nucleotide sequence, cytosine methylation patterns, DNA lesion 
formation and repair and in vivo protein-DNA footprints (paf et al, v 2000; Mueller and Wold, 
1989; Pfeifer et ah, 1989; Pfeifer et si., 1999i Becker and Grossman, 1993). Ligati on-mediated 
PAP (LM-PAP) involves cleavage, primer extension, linker ligation and PAP that can be applied 
5 for analysis of in vivo crnxmatin structure, such as, methylated state of chromosomes, and for 
othernuclcic acid analysis as for LM-PCR. 

[0048] The nature of LM-PAP is that the template is synthesized before PAP, such as by 
ligation reaction or by extension using terminal transferase. PAP may be any type of PAP: with 
only one P*, with two opposing oligonucleotides where at least one is P*, Bi-PAP, matched 
10 PAP, mismatched PAP, and so otL Thua, at its simplest, LM-PAP is the application of PAP lo a 
preaynthesized template. LM-PAP may be performed by steps (i), (ii), (in), (iv) and (v> T by 
steps (i), (ii), (tit) and (vi), by steps (ii), (iii), (iv) and (v) or by steps (it), (Hi) and (vi), where the 
steps are as follows. 

[0049] (i) The cleavage occurs chemically, enzymatically or naturally to "breakdown" nucleic 
15 acid strands. The nucleic acid usually is genomic DNA mat may have lesions ot nicks produced 
in vivo. 

[0050] (ii) The oligonucleotide PJ is gene-specific and its extension includes: 1) annealing to 
the template strand a substantially complementary oligonucleotide; 2} extending the 
oligonucleotide on the template strand in the presence of nucleoside triphosphates or their 
20 analogs and a nucleic acid polymerase, the extension "runs off" St the cleavage site on the 
template Strand. Steps I) and 2) may be repeated. 

[005 1] The primer extension may be replaced by a P* extension, in which the above PAP is 
performed with only one activatable oligonucleotide P*. 

[O052J (in) The linker ligation step includes ligation of a linker to the 3' terminus of the 
25 synthesized nucleic acid strand. The linker ligation step may be replaced by a terminal 
transferase extension that is non-template dependent polymerization and an extra nucleic acid 
sequence is added to the 3 T terminus of the synthesized nucleic acid strand. 
[0053J (iv) PCR is performed with a second gene-specific oligonucleotide (F2) together with an 
oligonucleotide specific for the linker or the added sequence by terminal transferase, 
lo [00541 (v) A third gene-specific P* (P3) is used to detect the PGR generated fragments. PAP 
method is applied with only one activatable oligonucleotide P*. The extension of the activated 
oligonucleotide P* "runs off 7 at the end of the template strand generated in IV. The PAP 
method may be applied in an allele-specific manner. The activatable oligonucleotide P* may 
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contain one or more nucleotides that are not complementary to the template strand. The 
uncomplimentary nucleotides) of P* may locate at the 3' terminus of P*. 
[0055] (vi) Instead of steps (iv) and (v), PA? method can he applied with two opposing 
oligonucleotides of which at least one is the activatable oligonucleotide P*. The activatable 
oligonucleotide P*(P3) is gene-specific. The second oligonucleotide is specific for the linker or 
the added sequence by terminal trans feraae. The second oligonucleotide may be another 
'*< activatable oligonucleotide P* or a regular oligonucleotide. The PAP method may he applied in 
an allele-specific manner. The activatable oligonucleotide P* (P3) may contain one or more 
nucleotides that are not complementary to the template strand. The uncomplimentary 
nucleotide^) of P* may locate at the 3" terminus of P* (P3). 

[0056") The third gene-epecjfie oligonucleotide (P3) is then usually Used to label and allow 
visualization of the PCR generated fragments. P3 is labeled at the 5' terminus with 31 P or, more 
recently, with near infrared fluoroehromes such as IXD 700 or IRD 800 (Li- Cor Inc.) (Dai et al., 
2000). 

[00S7J PAP can be used to detect a target nucleic acid. In one embodiment this method involves 
the following steps: 

[0058] (a) adding to a nucleic acid containing sample an oligonucleotide P*, wherein the 
oligonucleotide P* has a non-extendible 3' terminus, wherein the 3' terminal residue of 
oligonucleotide P* is removable by pyrophosphorolysis, and wherein the oligonucleotide P* 
anneals to a substantially complementary strand of tbe target nucleic add present in the sample; 
[0059] (b) removing the 3* non-extendible terminus of the oligonucleotide P+ annealed to the 
substantially complemcutaiy strand of the target nucleic acid by pyrophosphorolysis to unblock 
the oligonucleotide P* to produce an unblocked oligonucleotide; and 

[006O] (c) detecting the presence of the target nucleic acid, wherein the sequence of the target 
nucleic acid is substantially complementary to the sequence of the oligonucleotide p*. 
[00*1] The method of the first embodiment may further include before die detection step the 
step: {hi) extending the unblocked oligonucleotide using a nucleic acid polymerase to produce 
an extended oligonucleotide. The method may also include the addition of a second 
oligonucleotide which may or may not have a 3" non-extendible terminus. The second 
oligonucleotide may anneal to the substantially complementary strand of the target nucleic acid 
or it may anneal to the complement of tho substantially compIemenTarty strand of the tercel 
nucleci acid. 
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[0062] In a second embodiment for detecting a nucleic acid the method involves the following 
steps: 

[0063] (a) adding to a nucleic acid containing sample two oligonucleotide P*s, wherein each 
oligonucleotide P* has a non-extendable 3' terminus, wherein the 3' terminal residue of each 
& oligonucleotide P* is removable by pyrophosphorolysis, wherein one oligonucleotide P* 
overlaps with the other oligonucleotide P* by at least one nucleotide at their respective 3* ends, 
and wherein one oligonucleotide P* anneals to a substantially complementary strand of the 
target nucleic acid present in the sample and the other oligonucleotide p* anneals to a 
complement of the substantially complementary strand of the target nucleic acid; 
id [00641 (b) removing the 3' non-extendable terminus of the oligonucleotide P*s annealed to the 
target nucleic acid by pyrophosphorolysis to unblock the oligonucleotide P*s to produce 
unblocked oligonucleotides; and 

r0065] (c) detecting the presence of the target nucleic acid, therein the sequence of the target 
nucleic acid is substantially complementary to the sequence of the oligonucleotide P*s. 
is [0066] The method of the second embodiment may further include before the detection step the 
step; (bl) extending the unblocked oligonucleotide using a nucleic acid polymerase to produce 
an extended oligonucleotide. 

[006TJ In one embodiment, the detection of the nucleic acid in step (c) is performed by detecting 
the unblocking of oligonucleotide P*. In one aspect the unblocking is detected by loss of a 

20 label contained in the V terminal residue of oligonucleotide P*« In a second aspect, the 
unblocking is detected by detecting the presence of a 3' OH on the 3 f terminal residue that is 
capable of exten«ion or ligation. In this aspect, the detection is determined by extending the 
unblocked oligonucleotide or by ligating the unblocked oligonucleotide to an oligonucleotide* 
In a second embodiment, the detection of the nucleic acid in step (c) is performed by detecting 

as the extended oligonucleotide. In one aspect, the extended oligonucleotide is detected by the 
presence of a label in the extended oligonucleotide, The label is part of a nucleotide or 
nucleotide analog used in the extension step. In a second aspect, the extended oligonucleotide is 
detected by gel electrophoresis- In a third aspect, tbe extended oligonucleotide is detected by ihe 
binding or incorporation of a dye or spectra] material. 

30 [00681 The P* oligonucleotides are selected to be "substantially 11 complementary" to the 
different strands of each specific sequence to be amplified. Therefore, The P* oligonucleotide 
sequence need not reflect the exact sequence of the template. For example, a non- 
complementary nucleotide segment may be attached to the 5'-end of the P* oligonucleotide, with 
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the remainder of the p+ oligonucleotide sequence being complementary to the strand. 
Alternatively, non-complementary oases or longer sequences can be interspersed into the P* 
"oligonucleotide, provided that the P* oligonucleotide sequence has sufficient complementarity 
with the sequence of the strand to be amplified to hybridize therewith and form a template for 
s synthesis of the extension product of the other ¥* oligonucleotide. The ability to detect 
nucleic acid sequences which are substantially complementary to oligonucleotide P+ is 
particularly useful for the detection of multiple mutations, such as seen in high viral load, where 
the detection of the presence of the virus is important and not necessarily the exact nucleic acid 
sequence of the virus. This method is aieo capable of detecting nucleic acids that are completely 
1 o complementary. 

[0969] 'Hie present invention also includes other modifications of PAP. 

(00701 ' Th E activatablt oligonucleotide P* may contain blocked nucleotides at other positions 
in addition to the 3' terminus. 

[0O7IJ • The introduction of internal blocking nucleotides results in an interface between 
is amplification and PAP which would permit PAP to amplify in a non- exponential manner (e.g.> 
quadratic or geometric) with higher fidelity, i.e., errors made by the polymerase would not be 
prop abatable. 

[0072] ■ The activatable oligonucleotide P* may contain modified nucleotides that are 
extendible as well as the 3' blocked nucleotide. Thus, anywhere 5' to the 3' terminus, there may 
20 be either blocking or non-bfocking modified nucleotides. 

[0073J * A polymerase that pyrophosphorolyzes the mismatched primer rather than the matched 
primer could be used to detect rare mutation b in which the P* that mismatched at the 3' terminus 
is activated and extended. 

[0074] * The detection of a rare mutation is based on no mismatch anywhere along the length of 

{_ the oligonucleotide because a mismatch inhibits the activation of P"s. 

[0075] • Activation may occur by another mechanism, such as a 3" exonuelease. The 3" 
exonucleaae may have specificity for the matched primer or the mismatched primer so that it 
discriminates as to whether there is a mismatch at the 3 + end. The V exonuolease can be used 
either way. If it prefers a mismatch, it can be used as described above, but its ability to detect 

jo uncommon mutations would depend ou some specificity for activation, although that specificity 
may come partly from internal mismatches. 
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[0076] • The extension reaction can be performed by a DMA polymerase, an RNA polymerase 
or a reverse transcriptase, the template may be a DNA or an RNA, and the oligonucleotide P* 
may be a DNA, an RNA f or a DNA/RNA heteromer. 

[0077] • Pyrophosphorolysis and the extension can be performed by different polymerases. For 
s example, the P+ may include a penultimate modified oligonucleotide that could not be extended 
by pyrophosphorolyabg polymerase but could be extended by another polymerase. One 
example is a 3' dideoxy that could be pyrophosphorolyzed by a DNA polymerase, but the 
presence of a ribonucleotide in the penultimate position would require extension by an RNA 
polymerase. 

10 [0078] * PAP can be generalized as an inactive oligonucleotide that is activated by a nucleic 
acid metabolizing enzyme, such as helicases, fopoisomerases, telomerases, RNAH or restriction 
enzymes, 

[0079] ■ Methylases would detect the presence or absence of a methyl group in genomic DNA. 
Methylases could be coupled with truncating amplification which forces the polymerase hack to 
is the template. 

[0080] « A P* in which the 3' end is a dideoxy and penultimate few nucleotides arc ribos can be 
used as a tool for differentially making a protein product derived from a specific mutation that 
was desired, or for making a protein product whose expression is linked to the presence of a 
particular sequence. Pyiophosphorolysis would activate the P* if there was a precise match to 
so the mutation at the V end. The activated oligonucleotide is then a substrate for the generation of 
RNA by an RNA polymerase. The RNA could then be translated in vitro to produce the protein 
product. 

{0081] - PAP (PAP, Bi-FAP, matched or mismatched PAP, simplex PAP, multiplex PAP and 
others) can be used for quantification. The yield of the amplification products is quantitatively 
as associated with the amount of input template. The association may be proportional or othetwise. 
[0082] • In PAP, product may accumulate linearly, exponentially or otherwise. 

BRIEF DESCRIPTION OF THE FIGURES 

[0083] Figure 1 shows a schematic of the detection of a rare mutation by allele-specific PAP 
3 0 (PAP-A). 

[0084] Figure 2 shows a schematic of bidirectional PAP-A (Bi-PAP-A). 

[0085] Figure 3 shows a schematic of PAP-based resequencing (PAP-R) performed on a 
microarray with programmable photochemical oligonucleotide. 
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[00B6] Figure 4 shows a schematic Df microarray-based resequencing to detect a G to A 
mutation, 

[008T] Figure 5 shows a schematic of ligation-mediated PGR (LM-PCR). 
[0088] Figures 6 A and 6B arc a schematic illustrating use of PAP to detect the G allele at 
5 nucleotide 229 of the Dj dopamine receptor gene. The procedure is described in detail in 
Example t below. 

j [0089] Figure 6C is an autoradiogram of PAP from the G/G, A/A and G/A genotypes of the 
human dopamine receptor gene. 

[00901 Figures 7A and 7B arc diagrams illustrating enhanced specificity of PAP relative to 
10 PASA. 

[009 J] Figures 8 A and SB are autoradiograms showing the results of electrophoresis of samples 
obtained in Example I below. 

[00921 Figure 9 is an autoradiogram showing the results of electrophoresis of samples obtained 
in Example 1 below. 

i s f 0093 1 Fi gure 1 0 i s an autoradiogtam sho wing the results of el ectrophoresis of samples obtained 
in Example 1 below. 

[0094] Figure 11 A is a schematic illustrating enhancement of PAP efficiency. 
[0095] Figure 1 IB is an autoradiogram of PAP from the G/G, A/A and G/A genotypes of the 
human dopamine receptor gene. 
30 [O096] Figures 12A-12E arc autoradiograms showing the reBults of electrophoresis of samples 
obtained in Example 2 below. 

[0097] Figure 13 is an autoradiogram showing the results of electrophoresis of samples obtained 
in Example 2 below. 

[0098] Figure 14 is an autoradiogram showing the results of electrophoresis of samples obtained 
as in Example 2 below. 

[0099] Figure 15 is an autoradiogram showing the results of electrophoresis of samples obtained 
in Example 3 below. 

[0100] Figures 16A-16B show UV footprmting by LM-PAP. Fig. 16A shows allele-specific 
LM-PAP ver&us alltle-specific LM-PCR for the dopamine D T receptor gene. Fig. lfiB shows 
3 1> LM-PAP for the pgk gene . 

[01011 Figures 17A-I7B show PAP amplification directly from human (Fig. 17A) and mouse 
(Fig. 1 7B) genomic DNA using PAP and Bi-PAP. respectively. 
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[GT02] Figures 18A-18E show PAP amplification using 3' terminal acyclonucleotide blocked 
P*. Fig- 18 A: Model: A duplexed DNA template of the tad gene is shown. The mutated 
template contains a G at the nucleotide position 369, while the wild-type template contains a T 
at the nucleotide position 369 of the lad gene. P* - pyraphOHphorolysis acrtvatable 
s oligonucleotide. The P* has an acycloNMF or a ddNMP at the 3 T terminus. The P* is specific to 
the mutated template but mismatches to the wild-type template at the 3' terminus (Table 6). O - 
oltgodeoxynucleotide. PAP was performed with P*l and 01, P*2 and 02, or P*l and P*2, 
respectively. Fig. 185: PAP with 30 mer P*s: The P*s are specific for the mutated template but 
mismatch the wild-type template at their 3 P terminus. In lanes 1-8 are 3' terminal 

lc acyclonucleotide blocked P*s. In lanes 9-16 are 3' terminal dideoxynucleotidc blocked P*s for 
comparison. In lanes 1*4 and 9-12. the mutated template is used. In lanes 5-8 and 13-16, the 
wild-type template is used, The PAP product and P* are indicated with their sizes. Lane M is 
120ng of $X174-PUC]9///<wJrTDNA m&iker. Fig. 18C: PAP with 35-mex P*a; The experiment 
is the same as in Fig. 18B except with 35-mer ?*s that are 3' co-terminal with the 30-rtier P"s 

is and five nucleotides longer at their 5' termini. Fig. 18P: PAP with V&nt (exo-) polymerase. The 
experiment is the same as in Fig. 18B except that Ytxt (exo-) was used. Fig. 18E: PAP with Pfu 
(exo-) polymerase. The experiment is the same as in Fig. 19B except that Pfu (exo-) was used. 
[0103] Figure 19 abows that PAP has high selectivity to detect rare mutations in the abundance 
of the wild- type template, hi the example of nucleotide position 190, the mutation-specific P* 

2 0 matches the mutated A template but mismatches the wild-type T template at the 3' terminus. 
Specific and efficient amplification is indicated by thick arrows. When hybridized to the 
mutated A template, the P* cannot extend directly from the 3' terminal dideoxyriuclcotide, the 3' 
terminal ddTMP must be removed by pyrophosphoro lysis and the activated oligonucleotide is 
then extended efficiently. Two types of nonspecific amplification tram the T template are 

25 indicated as Types I and E The nonspecific amplification occurs rarely when mismatch 
pyrophosphorolysis occurs to generate a wild-type product that will not support efficient 
amplification as template for subsequent cycles (Type I) (the error is indicated by thin arrow and 
estimated frequency of as low as 10" 5 ). When both mismatch pyrophosphorolysis and 
misincorporetion occur extremely rarely to generate a mutated product (Type II) (the errors are 

30 indicated by thin arrows and estimated coupled frequency of 3.3* iQ* n ). Once the errors occur, 
the mutated product can be amplified exponentially in subsequent cycles and so it determines 
the selectivity. 
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(0104] Figures 20A-20B show Bi-PAP amplification. Fig. 20A: Schematic of Bi-PAP to 
detection a rare mutation. The mutation-specific assay with two mutated P* for nucleotide 190 is 
"shown. The downstream and upstream P*s contain a dideoxy T wld a didec-xy A at their 3' 
termini, respectively. They are specific for the T:A allele at nucleotide 1 90 (on the right), but are 
mismatched to the A:T wild-type ancle at their 3' termini (on the left). The P*s are 40 
nucleotides long and overlap at their 3' termini by ono nucleotide. On the left, no substantial 
product is generated from the wild-type template because of the mismatch- On the right, the 
mutated product is generated efficiently Irom the mutated template. Fig. 2 OB: Bt-PAP 
amplification directly from X DNA. Bach of the wild-type and mutation-specific Bi-PAF assays 
at nucleotide 190 was used to amplify a 79-bp segment of the lad gene from X DNAs. Foe the 
wild-type assay in lanes 1-3, the Wo wild-type P*s have 3' terminal ddA and ddT, respectively. 
For the mutation-specific assay in lanes 4-6 and lanes 7-9, the two mutated P*s ere with ddT and 
ddA at their 3' termini, respectively, hi lanes 1, 4 and 7, 2000 copies of the wild-type template 
were added to each reaction. In laces 2, 5 and 8, 2000 copies of the mutated template were 
added to each reaction. In lanes 3, 6 and 9, UO template was added. In lanes 7-9, 200 ng of 
human genomic DNA was added as carrier. The product and P* are indicated* Lime M is l20ng 
of^X174-PUCl9tffaeTT1DNA marker. 

|OI05] Figures 21A-21C show titration of template for sensitivity and selectivity of Bi-PAP. 
With the mutated P+s, the wild-type template was amplified to generate the mutated product in 
Experiment I. The mutated template was amplified to generate the mutated product in 
Experiments IL III and TV. Fig. 21 A: The mutation-specific Bi-PAP assay for A190T In 
Experiment I, the copies of the wild-type X DNA are indicated in lanes 1-5. Lane 6 is a negative 
control with uo DNA. In Experiment U, the copies of the mutated X DNA are indicated in lane 
7_j i. Lane 1 1 {0.2 copy) is a negative control to support the dilution accuracy of copy number. 
Lane 12 is a negative control with no DNA. In Experiment HI, the copies of the mutated X DNA 
in the presence of 2x10 s copies of the wild-type X DNA are indicated in lane 13-17. Lane 18 is a 
negative control with only the wild-type X DNA. In Experiment TV, the copies of the mutated X 
DNA in the presence of 100 ng ofhumati genomic DNA are indicated in lanes 19-23. Lane 24 is 
negative control only with the human genomic DNA. Lane "C WT 1 ' is the wild-type product 
control in which the wild-type P*s were used to amplify 2000 copies of the wild-type X DNA. 
Lane "C Mm" is the mutated product control in which the mutated P*s were used to amplify 
2000 copies of the mutated % DNA. The wild-type and mutated products with unique mobilities 



17 

are indicated. Fig. 21B: The mutation-specific Bi-PAP assay for T369G. Fig. 21C: The 
mutation-specific Bi-PAP assay for T369C. 

{0106] Figure 22 shows a design of P* microarray for Bi-PAP reaequencing. Bi-PAP can be 
used for resequencing to detect unknown mutations in a known region on a microarray, The P*s 
s are designed according to the wild-type template. The two opposing P*s for each Bi-PAP are 
anchored in a microarray spot. Each pair of arrows represents four Bi-PAPs for one nucleotide 
position, A mutation is indicated on the template, and it spans six overlapped P*s. On the 
microarray, many Bi-PAPs can he processed in a parallel way. 

[0107] Figures 23A-23B show a schematic of Bi-PAP resequencing. Fig. 23A: Detection of the 

10 wild-type sequence, This is a close look at the microarray. The P*s are designed according to 
the wild-type sequence. On the position of nucleotide A, four Bi-PAPs are synthesized with four 
pairs of P^s. The four downstream p*s have identical sequence, except that at the 3* terminus 
either ddAMP, ddTMP, ddGNtP or ddCMF, corresponds to the wild-type sequence and the three 
possible single base substitutions. The four corresponding upstream P*s have identical sequence, 

is except that at the 3' terminus either ddTMP. ddAMP, ddCMP or ddGMP. Eaeh pair of P*s have 
one nucleotide overlap at their 3' termini. On the next nucleotide C, another four pairs of P*s are 
synthesized (not shown). If the wild-type sample is added, only the wild-type Bi-PAPs generates 
the specific product that is labeled by fluorescence. In this way, to scan a 1 kb region, you need 
S000 P*s. Fig- 23B: Detection of an A to T mutation. On the mutated nucleotide T, the 

20 mutation-specific Bi-PAP generates the mutated product. On the next nucleotide G, no product 
of Bi-PAP is generated because each pair of P* contains one or two mismatches (not shown). 
I0JOB) Figures 24A-24B show Bi-PAP resequencing microarray. Fig. 2 4 A: Detection of the 
wild-type sequence. Four pairs of P*s are designed for each nucleotide position according to the 
wild- type sequence. Each pair of P*s are downstream and upslream directed, and have one 

25 overlap and complementary nucleotide at their 3* termini. The wild-type P* pair are specifically 
amplified on each nucleotide position. If all of the wild-type P* pairs specifically amplified, the 
wild-type sequence can be determined. Fig. 24B: Detection of the A to T mutation. With the 
mutated template, the mutation-specific Bi-PAP is amplified. There is a window of no Bi-PAP 
signals centered by the mutation-specific Bi-PAP and three successive nucleotides on each side. 

a c The paired specific subsequence is suppoued to be seven nucleotides long. Any unknown single- 
base substitution can be determined, even if it is a heterozygous mutation. AJso, small deletions 
and insertions can be detected and localized. 
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[01091 Figure 25 shows PAP de novo sequencing on microarray. PAP can also be used for de 
n^vo DNA sequencing of an unknown region. The paired specific subsequence is supposed to 
fifteen nucleotides long. F* pairs of a complete set of the paired specific subsequence are on a 
microarray with known addresses. After the unknown DNA sample is added, Bi-PAP is 
performed. All the amplified B-PAP products are collected and then the paired specific 
subsequences of the amplified P* pairs are assembled by cne-nucleotide overlapping. Thus, the 
unknown complementary sequence is reconstructed. 

(0110] Figures 26A-26C show the detection of somatic mutations. Fig. 26A: Eighteen genomic 
DNA samples of the lact transgenic mice were chosen, 2 ug genomic DNA of each sample was 
amplified with the assay B to detect the T369G mutation two times. Samples 1-10 are from 
livers of 25-month old mice. Samples 1 1-14 are from hearts (samples 11, 13 and 14) and adipose 
{sample 12) of 6-month old mice. Samples 15-1 S are from brains of 1 0-day old mice. P = 
positive control that amplified the mutated X DNA, N = ncgalive control with no DNA + + — 
amplified product, - — no product. Fig. 26B: The assay B was performed. In lanes 1 1-12, 13-16 
and 17-20, 2 v-S. 0.5 pg. and 0-125 pg of the lacf mouse genomic DNA of sample 12 were used 
in each reaction, respectively. Lanes 1-10 are controls; the copy number of the mutated A DNA 
per reaction was reconstructed by two-fold serial dilutions. In lanes 1-10 and 13-20, each 
reaction also contained I ug of the tad mouse genomic DNA carrier, ss = single-stranded, ds = 
double-stranded. Fig, 26C: The assay B was performed. In lanes 11-14, 2 pg of the lacl* 
mouse genomic DNA of sample 3 was used in each reaction. Tji lanes 15-18, 2 pg of the lact 
mouse genomic DNA of sample 9 was used in each reaction, Lanes 1-10 are controls; the copy 
number of the mutated A. DNA per reaction is indicated. Each control reaction also Contained 1 
pg of the lacT mouse genomic DNA carrier. 

DETAILED DESCRIPTION OF THE INVENTION 
[01 1 Ij The following terminology is used herein. 

[0112] Pyrophosphorolysis: removal of the 3' nucleotide from a nucleotide strand chain by 
DNA polymerase in the presence of pyrophosphate (PPi) to generate the nucleotide triphosphate. 
This is the reverse of the polymerization reaction. 

[01 13] PAP: Pyrophosphorolysis activated polymerisation, PAP can use one P* or can use two 
opposing oligonucleotides in which at least one is P*. 

[0114] P*: an oligonucleotide with a non-extendible 3' terminus (or end) that is activatable by 
pyrophosphorolysis. 
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[0115] PAP-Ai PAP-hased aUele-specific amplification that can be used for detection of rare 
mutations (Fig. 1). 

[Olltf] Bl-PAP-Ai PAP-A performed with a pair of opposing P*, i.e., bidirectional (Fig. 2) with 
at least one nucleotide overlap at their 3' termini, 
s [01171 PAP-R: PAP-based resequencing for detection of unknown mutations within a known 
sequence (Figs. 3 and 4). 

[011 8] LM-PAP: ligation-medieted PAP. The nature of LM-PAP is that the template is 

synthesized befom PAP, such as by ligation reaction or by extension using terminal transferase, 

[0119] LM-PCR: Hgation-mediatedPCR(Fig. 5). 
10 [0120] G v or A y alleles: alleles of the common polymorphism of the dopamine D| receptor gene 

that was used as a model system herein (also referred to herein as G° or A 0 alleles). 

[0121] Linear PAP: PAP with only one P* for linear product accumulation. 

[0122] Exponential PAPi PAP with two opposing oligonucleotides for exponential product 

accumulation, and at least one is P*. 
is [0123] Noise rate {%): the relative yield of the mismatched product to the matched product A 

specific signal for PA? ia defined as a noise rate of less than 1 0%. 

[0124] PASA: PGR amplification of specific alleles (also known as allele-specific PCR or 
ARMS). 

[0125] Resequencing: scanning for unknown mutations and determining the precise sequence 
2 o changes within a known sequence. Resequencing is distinguished from de novo sequencing. 
[0126] Mutation load: the frequency and pattern of somatic mutations within a tissue. 
[0127] Minimal residual disease: e.g., rare remaining cancer cells hi lymph nodes and other 
neighboring tissues or early recurrence after remission. 

[0128] NoH-ejrtendlble 3' terminus (or end): a nucleotide or nucleotide analog at the 3' 

2 5 tenuinus (or end) of oligonucleotide P* that is non -extendible but that is activatable by 

pyrophosphorolysis. Examples of a non-exttndible 3' termini (or ends) include, but are not 
limited to, a 2 , 3'-dideoxymicleotide J . an acyclonucleotide, 3'*deoxyadenosine (cordycepin), 3'- 
azido-3'-deoxythymidine (AZT), ^'-dideoxyinosine (ddl), 2\3*-dideoxy-3Mhiacytteine (3TQ 
and 2 l ,3 l -didehydro-2 1 ,3 , -dideoxythymidine (d4T). 

3 o [0129] Simplex PAP: one PAP (PAP, Bi-PAP, matched or mismatched PAP, and others) in one 

reaction tube or on a solid support. 

f0130] Multiplex PAP: more than one PAP (PAP, Bi-PAP, matched or mismatched PAP, and 
others) in one reaction tube or on a solid support, e.g., microarray. 
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[0131] Matched PAP: PAP having araatch between P* and its template. 
10132) Mismatched PAP: PAP having a mismatch between P* and its template, 
1 01 33 1 Nested PAP: PAP using two or more pairs of P* in which one pair is located inside a 
second pair on a template nucleic acid. 

[01341 Hoistart PAP: PAP in which an essential, reaction component is withheld until 
denaturation temperatures are approached {Charo et al., 1992; Kellogg et al., 1994; Mullis, 
1991; D'Aquiia el al., 1991). Essential reaction components can be withheld by, e.g, a 
neutralizing antibody bound to the polymerase, sequestering a component, such as the polymers 
or MgClj in wax, chemically modifying the polymerase to prevent activation until high 
temperature incubation or separating components by wax. 

[0135] Truncated Amplification: an amplification method which amplifies non-exponentially, 
e.g., in a quadratic or geometric manner, with over two chimeric oligonucleotides and produces 
truncated terminal products that are no more than three rounds of replication from the original 
template. (Liu et al., 2002). 

[0136J Reactive s 3 OH: is a 3' OH that is capable of being extended by a nucleic acid 
polymerase or Jigated to an oligonucleotide. 

(0137] DNA polymerases, which are critical to nucleic acid amplification, catalyze some or all 
of the following reactions: i) polymerization of deoxynuclcondc triphosphates or their analogs; 
ii) pyrophosphorolysis of duplexed 1>NA in the presence of pyrophosphate (PP), [dNMF]„ + 
x[FPi] ...tdmiP]™ + xfdNTP]; iii) ^-S'exonuclease activity (which does not require PPi), and 
iv) 5 T -3' exonuclease activity (Duetcher and Komberg, 1969; Romberg and Baker, 1992). For 
Taq and Tft DNA polymerases, polymerization and 5'-3' exonuclease activity have been reported 
(Chien et al„ 1976; Kaledin et al., 1981; Lcngley et al., 199G). For T7 Sequenase™ and Thermo 
Sequenase™ DNA polymerases, pyrophosphorolysis can lead to the degradation of specific 
dideoxymicieotide- terminated fiegmenta in Sanger sequencing reaction (Tabor and Richardson, 
1990; Vander Horn et al., 1997). 

[0138] Pyrophosphorolysis is generally of very minor significance because PP| is degraded by 
pyrophosphatase under normal physiological conditions. However, in the presence of high in 
vitro concentrations of PP { , pyrophosphorolyais can be substantial. p 0 r oligonucleotides with a 3' 
terminal dideoxy nucleotide, only pyrophosphorolysis is possible. Once the dideoxy nucleotide 
is removed, the activated oligonucleotide can be extended by polymerisation, 
[0139] Pyrophosphorolysis activated polymerization (PAP) offers a novel approach for 
retrieving ft diversity of information from nucleic adds. The exceptional specificity of PAP 
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derives from the serial coupling of two reactions. PAP involves the activation by 
pyrophosphorolysis of a 3" terminal blocked oligonucleotide (P*) followed by extension of the 
activated oligonucleotide by DNA polymerization. Operationally, PAP involves the use of an 
activatable oligonucleotide (P*) in place of a normal oligonucleotide that can be directly 
s extended. Examples of P* include an inactive dideoxy terminated oligonucleotide P* or an 
inactive chemically modified nucleotide lacking a 3 1 hydroxyl group, such as an 
acyclomicieotide, or having a non-extendible nucleotide terminated oligonucleotide P*, 
Acycloolonucleotides (acycloNTPs) in which the sugar ring is absent are known to act as chain 
terminators in DNA sequencing (Sanger et al., 1977; Trainor, 1996; Gardner and Jack, 2002). 

id The activation of P* is inhibited by mismatches throughout the length of the oligonucleotide. 
Mismatches even two nucleotides from the 5 h terminus inhibit PAP amplification. 
[0140] Activation of a P* by pyrophosphorolysis offers extraordinary Specificity throughout the 
length of P*. The enhanced specificity can be used to detect rare known mutations, to elucidate 
unknown mutations by resequencing, to determine unknown sequence by de novo sequencing, to 

15 measure gene expression levels, to compare two sequences, and to increase the specificity of in 
vivo analysis of chromatin structure. Microarray-uaeed programmable photochemical 
oligonucleotide synthesis and PAP are synergistic technologies. Thus, the enhanced specificity 
can be used for rapid, microarray* based resequencing, de novo sequencing, gene expression 
profiling and SNTP detection. 

20 [0141] A number of methods for enzymatic nucleic acid amplification in vitro have been 
developed and can be adapted to detect known sequence variants. These include polymerase 
chain reaction (PGR) (Saiki et al., 1935; Saikr et al., 1988), Ugase chain reaction (LCR) 
(Landegren, 199&; Barany, 1991) and rolling circle amplification (RCA) (Baner et al., 1993; 
Lizardi et al., 199S), Herein, we describe pyrophosphorolyaia activated polymerization (PAP), 

a s an approach that has the potential to erxhance di-amatically the specificity of PGR allele-specific 
amplification (Sommer et al., 1989). PAP differs from corrections with PGR in multiple ways: i) 
the P* oligonucleotide is blocked at the 3' terminus and must be activated by 
pyrophosphorolysis, ii) pyrophosphorolysis and polymerization are serially coupled for each 
amplification, iii) PAP may be performed with one P* for linear amplification or with two 

30 oligonucleotides for exponential amplification, iv) PPi is necessary for the amplification, v) 
significant nonspecific amplification would require the serial coupling of errors of both 
mismatch pyrophosphorolysis and misincorporation. 
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[D142] The enhanced specificity of PAP relative to PAS A is provided by serially coupling 
pyrophosphorolysis and polymerization. Significant nonspecific amplification requires 
mismatch pyrophosphorolysis and misincorporation by DNA polymerase, an extremely rare 
event. For example as described herein, DNA polymerase was utilized to detect the G allele at 
nucleotide 229 of the D| dopamine receptor gene. P+ was synthesized either with ddA, ddT, ddG 
or ddG at the 3' terminus. The 3' terminal dideoxynuclecrtide inhibits direct extension by 
polymerization, but can be removed by pyrophosphorolysis in the presence of pyrophosphate 
(PPj) when the P* is specifically hybridized wirh the complementary Strand of the G allele. The 
activated oligonucleotide can be extended by polymerization in the 5'-3' direction. 
[0143] Evidence is presented that pyrophosphorolysis followed by polymerization can be used 
to increase the specificity of P ASA. Significant nonspecific amplification with PAP requires the 
serial coupling of the two types of errors, i.e., mismatched pyrophosphorolysis and 
misinctjipuratiort (Fig. I). The rate of mismatched pyrophosphorolysis is expressed as the 
relative rates of removal of a 3' mismatch deoxynucleotide relative to the correct 3' 
deoxynucleotide. The rate of mismatch pyrophosphorolysis is less than 10" s for T7 DNA 
polymerase (Romberg and Baker, 1992; Wong et al., 1991). The misincorporation rate to create 
a substitution mutation by polymerization, expressed- as the incorporation rate of an incorrect 
versus a correct dNMP, was reported to be 10" 3 for T7 DNA polymerase and to be 10"* for E. 
coli DNA polymerase T fKomberg and Baker, 1992; Wong et al„ 1991; Bebenek et al„ 1990). 
Similar results were reported for Taq DNA polymerase, 3'- 5' exonuclease-deficient mutants of 
T7 DNA polymerase and E. coli DNA polymerase I (Komberg and Baker, 1992; Wong et al., 
1991 ; Bebsnek et al., 1990; Eckert and Kunkel, 1990). 

[0144] PAP is a method of synthesizing a desired nucleic acid strand on a nucleotide acid 
template strand. In PAP, pyrophosphorolysis and polymerization are serially coupled for nucleic 
acid amplification using pyrophosphotolysis activatable oligonucleotides (P*). P* is an 
oligonucleotide that is composed of N nucleotides or their analogs and bag a non-extendible 
nucleotide or its analog at the 3 T terminus, such as 3\5' [ dideoxynuclcotide. When substantially 
hybridized on its template strand, P* could not be extended directly from the 3' terminal 
nucleotide or its analog by DNA polymerase, the 3' terminal nucleotide Or its analog of the P* 
can be removed by pyrophosphorolysis and then the activated oligonucleotide (< N) can be 
extended on the template. 

J0143] The method comprises the following steps carried out serially. 
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[01461 Annealing to the template strand a substantially complementary activatable 
oligonucleotide P*. This activatable oligonucleotide P* has a non-extendible nucleotide or its 
analog at the 3' terminus. 

[0147] (b) Pyrophosphorolyzing the annealed activatable oligonucleotide P* with 
5 pyrophosphate and an enzyme that has pyrophosphorolysis activity. This activates 
oligonucleotide P* by removal of the 3' terminal nan- extendible nucleotide or its analog. 
]0148) (c) Polymerizing by extending the activated oligonucleotide P* on the template strand in 
the presence of nucleoside triphosphates or their analogs and a nucleic acid polymerase to 
synthesize the deaired nucleic acid strand. 
10 [0149] The PAP method can be applied to amplify a desired nucleic acid strand by the following 
additional steps, 

[0150] (d) Separating the desired nucleic acid strand of step (C) from the template strand, and 
[0151] (e) Repeating steps <A)-(D) until a desired level of amplification of the desired nucleic 
acid strand is achieved, 

is [01521 The above PAP method can be applied to allele-specific amplification. The activatable 
oligonucleotide P* has one or moTe nucleotides that are not complementary to the template 
strand. The uncomplimentary nucleotide(s) of P* may locate at the 3' terminus of P*. The above 
step of (A), (B) or (C) could not occur substantially. As the result, the desired nucleic acid strand 
is synthesized substantially less. 

2 a [0153] The above PAP method can be applied with only one activatable oligonucleotide P*. (e) 
Repeating steps <a)-(d>, a desired level of amplification of the desired nucleic acid strand may be 
achieved iinearly. The targeted nucleic acid region outside the annealing region may be t»r 
different sizes or of different sequence contexts, so the synthesized nucleic acid strands are of 
different sizes or of different sequence contexts. 

25 [0154] The above PAP method can be applied with two opposing oligonucleotides of which at 
least one is the activatable oligonucleotide P*. The activatable oligonucleotide P* and the 
second oligonucleotide are targeted for amplification of a nucleic acid region. Steps (a)-(c) occur 
to the activatable oligonucleotide P*. The second oligonucleotide ia substantially 
complementary to the other template strand. If the second oligonucleotide is another activatable 

3 0 oligonucleotide P*. steps (a>(c) occur. If the second oligonucleotide is a regular extendible 
oligonucleotide, steps (a) and (c) occur; (modified a) annealing io its template strand, followed 
by (modified c) polymerizing by extending the oligonucleotide on its template strand in the 
presence of nucleoside triphosphates or their analogs and a nucleic acid polymerase to 



WO Il3/(iy56r>4 



WO IU/U9S664 



PCTVUSOUI-tSM 



24 

synthesize the desired nucleic acid strand, (c) Repeating steps (a)-(d), or ateps (a), (c) and (d), a 
desired level of amplification of the desired nucleic acid strand may be achieved, e.g., 
'exponentially. The targeted nucleic acid region between the two annealing regions of the two 
opposing oligonucleotides may be of different sizes or of different sequence contexts, so the 
5 synthesized nucleic acid strands are of different sizes or of different sequence contexts, 

[0155] LM-PAP involves cleavage, primer extension, linker ligation and' PAP Ihat can be 
applied for analysis of in viva chromatin structure, such as, methylated state of chromosomes. 
[0156] LM-PAP may be performed by steps (i), (ii), (in), (iv) and (v), by Steps (i), (ii), (iii) and 
(vi), by steps: (ji), (iii), (iv) and (v) or by steps (ii), (iii) and (vi), where the steps are as follows. 
10 [0157J The cleavage occurs chemically, enzymatically or naturally to "breakdown" nucleic acid 
strands. The nucleic acid usually is genomic DNA that may have lesions or nicks produced in 
vivo. 

|0158J (ii) The primer of PI is gene-specific and its extension includes: 1) annealing to the 
template strand a substantially complementary primer; 2) extending the primer on the template 
15 strand in the presence of nucleoside triphosphates or their analogs and a nucleic acid 
polymerase, the extension "runs, off 1 at the cleavage site on the template strand. Steps 1) and 2) 
maybe repeated. 

[0I59i The primer extension may be replaced by a P* extension (The above PAP with only one 
activatablc oligonucleotide P+). 

30 (0160 1 (iii) The linker ligation step includes ligation of a linker to the 3' terminus of the 
synthesized nucleic acid strand. The linker ligation step may be replaced by a terminal 
transferase extension that is non-template dependent polymerization and an extra nucleic acid 
sequence is added to the 3' terminus of the synthesized nucleic acid stmnd. 
[016"1] (iv) PCR is performed with a second gene-specific primer (P2) together with a primer 

{ specific for the linker or the added sequence by terminal transferase. 

[0162] (v) A third gene-specific P* (P3) is used to detect the PCR generated fragments. PAP 
method is applied with only one activatabJe oligonucleotide P*. The extension of the activated 
oligonucleotide P+ ' runs off" at the end of the template strand generated in step (iv). The PAP 
method may be applied in allele-specifie manners. The activ&Tabl* oligonucleotide P* may 

30 contain one or more nucleotides that are not complementary to the template strand. The 
uncomplimentary mieleotide(s) of P* may iucato at the 3" terminus of P*. 
J0163J (vi) Instead of steps (iv) and (v), PAP method can be applied with two opposing 
oligonucleotide 1 ! of which at least one is the activatable oligonucleotide P*. The Hctivatable 



15 

oligonucleotide P*(P3) is gene-specific. The second oligonucleotide is specific for the linker or 
the added sequence by terminal transferase. The second oligonucleotide may be another 
activatable oligonucleotide P* or a regular primer. The PAP method may be applied in ftllele- 
specific manners. The activarable oligonucleotide P* (P3) may contain one or more nucleotides 
5 that arc not complementary to the template strand. The uncomplimentary nucleot£de(s) of P* 
may locate at die 3' terminus of P* (P3). 

[0164] Fig. I shows detection of a rare mutation by aHefe-speciric PAP (PAP-A). PAP-A can 
detect a r&re allele with extremely high specificity because an allele- specific oligonucleotide 
with a 3' dideoxy terminus (?+) permits the serial coupling of pyrophosphorolysis and 
lo polymerization. For example, if an allelcnspecific oligonucleotide has a 3' dideoxy terminus 
(P*) that matches a rare "T allele, activation occurs by pyrophosphoroJytic removal of the 
dideoxy nucleotide and js followed by polymerization (Situation A). Activation by 
pyrophosphorolysis does not normally occur with a mismatch at the 3 'terminus as with the wild- 
type "C allele (Situation B). Rarely, pyrophosphorolysis does occur at a mismatch (estimated 
is frequency 10 ,s ), but the activated oligonucleotide is extended to produce wild-type sequence 
(Situation Q. A product that supports efficient amplification is generated when mismatch 
pyrophosphorolysis occurs, a polymerase error that inserts A opposite C in template DNA 
{Situation D). The frequency of mismatch pyrophoEphoroIysis coupled with the polymerase 
mutation is estimated at lO^xlO" 6 - 3xi0" n . 
20 [0165] PAP has a potential specificity of 3x10"". Approaching this potential requires a design 
that eliminates confounding sources of error. For example, extension errors from non-blocked 
upstream oligonucleotides can generate a product with the mutation of interest If the 
misincorporation rate for TaqFS is about 10" 5 per nucleotide and only one of the three 
misincorporations generates the mutation of interest, the error rale is about 3.3x10^. Polymerases 
25 that contain a proofreading function might have an error rate per specific mutation of 3x1 0' 7 . 
Polymerases or polymerase complexes with lower error rates would improve specificity further. 
[0166] One approach utilizes linear PAP. Linear PAP-A may be performed for 40 cycles with 
only P* in the presence of a fluorescent or radiolabeled ddNTP. A labeled terminated product of 
defined size will be generated when P+ is activated. Linear PAP-A has the advantage of utilizing 
so only the original genomic DNA and eliminating error due to misincorporation from extension of 
an unblocked upstream primer. However, the sensitivity of detection is limited because the level 
of amplification is not greater than the number of cycles. For a simple genome like lambda 
phage, a detection specificity of 10 s is possible. The specificity of linear PAP-A depends 
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critically on the absence of unblocked, extendible oligonucleotides. To achieve a robust 
specificity of 10"*, unblocked extendible oligonucleotides should he present at 10' 7 , This may be 
achieved by treating gel purified P* (about 99.99% pure with our present protocol) with a 3' to 5 1 
exonuclease to degrade unhlocked molecules followed by repurification by gei electrophoresis. 
5 |0N>7[ A second approach is bidirectional PAP-A (Bi-PAP-A; Fig. 2). In Bi-PAP-A, both the 
downstream and upstream oligonucleotides are P+s that are specific for the nucleotide of 
j interest. The P*s overlap at their 3' termini by one nucleotide. This design eliminates extension 
error from a non-blocked upstream oligonucleotide. This design should not be limited by small 
amounts of active contaminating oligonucleotide to which the dideoxy taminus has not been 

10 added (about 0.01% with our current protocol) because the product generated will be that of a 
control and will not be a substrate for efficient amplification in subsequent cycles. 
[0168] Bi-PAP-A generates a product that is the size of a primer dimer. However, it is not a 
primer dimer in the conventional sense, in tbat template DNA with a mutation of interest is an 
intermediate required to generate a product that is an efficient substrate for amplification in 

i& subsequent cycles. Bidirectional PAP-A eliminates important bottlenecks to specificity and has 
the potential to reach a specificity of 1 Of*. 

[0169] As shown in Fig. 2, both the downstream and the upstream P+s are specific for the 
nucleotide of interest at the 3' ternurii (an AT base pair in this example). In the initial round of 
amplification from genomic DNA, segments of undefined size will be generated. In subsequent 

ao rounds, a segment equal to the combined lengths of the oligonucleotide minus one will be 
amplified exponentially. Nonspecific amplification occurs at lower frequencies because this 
design eliminates misincorporation error from an unblocked upstream oligonucleotide that can 
generate the A:T template from a G:C wild-type template with an error rate of 3x10"*. The P+s 
may be 30-60 nucleotides for most efficient amplification. Situation A shows that a template 

25 with a rare A:T allele will be amplified efficiently, Both the upstream and the downstream P+s 
are amplified efficiently. Situation B shows that if the DNA template contains the wild-type 
0:C sequence, neither the downstream nor the upstream P+ will be activated substantially. 

[01 70) Rapid resequencing will facilitate elucidation of genes that predispose to cancer and 
other complex diseases. The specificity of PAP lends itself to resequencing. P+s may be 

30 photochemical ly synthesized on micro arrays using flexible digital micramirror arrays. 

(0171) Microarrays of immobilized DNA or oligonucleotides can be fabricated either by in situ 
light-directed combinational synthesis or by conventional synthesis (reviewed by Ramsay, 199S; 
Marsliall find Hodgson, 1993). Massively parallel analysis can be performed. Photochemical 
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synthesis of oligonucleotides is a powerful means for combinatorial parallel synthesis of 
addressable oligonucleotide microarrays (Singh-Gfrsson et a1.„ 1999; LeProust et ah, 2000). This 
flexible alternative to a large number of photolithographic masks for each chip utilizes a 
maskiesg array synthesizer with virtual mask* generated on a computer. These virtual masks are 
relayed to a digital microtninor array. An ultraviolet image of the virtual mask is produced on 
the active surface of the glass substrate by a 1;1 reflective imaging system. The glass: substrate is 
mounted in a flow ceil reaction chamber connected to a DNA synthesizer. Cycles of 
programmed chemical coupling occur after light exposure. By repeating the procedure with 
additional virtual masks, it is possible to synthesize oligonucleotide microarreys with any 
desired sequence, The prototype developed by Singh -Gaason, et al. synthesized oligonucleotide 
microarrays containing more than 76,000 features measuring 16 square microns. 
T01721 By combining programmable photochemical oligonucleotide synthesis with digital 
mirrors and oligonucleotide extension of F+, a high throughput and automated method of 
resequencing is possible. PAP-R may detect virtually 100% of single base substitutions and 
other small sequence variants because of its high redundancy; the mismatch spanned by the 
several overlapping P* oligonucleotides will prevent activation of a cluster of overlapping P*s. 
One strategy for resequencing is shown in Figs. 3 and 4. Fig. 3 shows a schematic of PAP-R 
performed on a microarray with programmable photochemical oligonucleotide: PAP can be used 
for resequencing to detect unknown mutations. On this microarray, the wild-type template is 
indicated. The P+s are designed according to the wild-type template. The P"s that overlap with 
the mutation generate little or no signal indicated as "Low" PAP signal. 

[0173] Fig. 4 shows an example of solid support-based, eg., mictoarray-b&sxd, resequencing to 
detect a G to A mutation. Linear PAP is performed with four different dye-labeled ddNTPs as 
substrates for single-base extensions. ?*& have a specific region of 16 nucleotides within the 3' 
region of the oligonucleotide. Homozygous or hemizygous DNA template is utilized in the 
example. Sets of four P*s> with identical sequence except for the four ddNMPs at the 3" 
terminus > are synthesized for each nucleotide position on the sense strand of the wild-type 
sequence. The P* with a ddA at the 3' terminus generates a PAP signal at the site of the G-A 
mutation. The mutation also creates a 15 base "gap" of no PAP signal for the subsequent 
overlapping 15 sets of P+s. For heterozygous mutation, the P+s with ddA and ddG provide PAP 
signals. The heterozygous mutation also generates the 15-base "gap" of 50% signal intensity 
(which is flanked by signal Of 100% intensity). For added redundancy with heterozygotes 
samples, antisense P+s can be utilized (not shown). An unknown single-base substitution Can be 
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determined by combination of the two sets of P*3. Small deletions and insertions con be detected 
and localized. 

"[0174] With 100,000 oligonucleotides per microairay, about 12 kb can be resequenccd from 
downstream and upstream directions. The detection of virtually all mutations requires 
supplementation of the standard Geniom® instrument software, For wild-type sequence, the 
signal intensities may vary. Certain oligonucleotides will generate a weaker signal due to 
secondary structure and other factors. The pattern of signal from wild-type samples should be 
distinguished reliably from the pattern generated by a given sequence change. The preliminary 
data suggest that almost all mismatches will inhibit activation dramatically. Because of the 
redundancy, mutations may be reliably distinguished from the wild-type even if a significant 
minority of single base mismatches does not inhibit activation substantially. 
10175 J It is becoming increasingly apparent that in vivo chromatin structure is crucial for 
mammalian gene regulation and development. Stable changes in chromatin structure often 
involve changes in methyiation and/or changes in histone acetylation. Somatically heritable 
changes in chromatin structure are commonly called epigenetic changes fRusso and Riggs* 
1 996) and it is now clear that epigenetic "mistakes" or epimutations are frequently an important 
contributing factor to the development of cancer {Jones and Laird, 1999). 
[0I76J One of the few methods for assaying in vivo chromatin structure, and the only method 
wilh resolution at the single nucleotide level, is ligaticn-mediated PCR {LM-PCR) (Mueller and 
Wold, 1939; Pfeifer et al., 19S9). LM-PCR has been used to assess chromatin structure, 

1 methyiation and damaged DNA. Fig. 5 shows a schematic of LM-PCR in which a DNA lesion 
in the starting DNA is indicated by a small diamond. LM-PCR involves cleavage, primer 
extension, linker ligation and PCR amplification. LM-PAP is similar to LM-PCR except that 
activatable oligonucleotide P*s are used. 

[0177] LM-PCR has proven to be an important technique, now having been used in over 100 
published studies (Pfeifer et al., 1999). Many aspects of chromatin siructure can be determined 
by LM-PCR, such as the location of methylated cytosine residues, bound transcription factors, 
or positioned nucleosomes. Importantly, the structure is determined in cells that are intact and 
have been minimally perturbed. UV photo-footprmting, for example, is performed by UV 
irradiating tissue culture cells in a Petri dish, immediately extracting the DNA, and performing 
LM-PCR to determine the location of thymidine dimers, the formation of which is affected by 
bound transcription factors. 



29 

[0178] Allele-specific LM-PAP can be applied to quantitatively determine the level of in vivo 
methyiation. The background of LM-PCR currently limits reliable estimation of the level of 
methyiation. It is generally considered that 0 %, 50 % and 100% methyiation can be determined, 
but distmguishirtg finer gradations is not reliable. With a marked reduction in background in 
s LM-PAP, 0%, 20%, 40% 60% 80%, and 100% methyiation standards may be distinguished 
reliably. It will be of particular interest to utili7.e allele-specific LM-PAP to examine die level of 
methyiation in imprinted regions, or in active verses inactive X-chromosomal genes in females. 
It is anticipated that LM-PAP will decrease the skill and experience needed to examine 
chromatin structure, thereby facilitating analysis of chromatin structure by more laboratories. 
10 [0179] LM-PAP has a diversity of applications. It will be of particular interest to utilize allele- 
specific PAP to examine differential methyiation and chromatin structure in imprinted genes or 
in active versus inactive X chromosomal genes in females. In addition, the relationship between 
mutagens, DNA damage, and mutagenesis can be examined. 

[0180] In PAP, as described above and illustrated herein, pyrophosphorolysis and 
is polymerization by DNA polymerase are coupled serially by using pyrophosphorolysis 
activatable oligonucleotide. In PAP sequencing, the principle is based on the specificity of PAP 
and in turn on the base pairing specificity of the 3' specific subsequence. This property of the 3 T 
specific subsequence can be applied to scan for unknown sequence variants, to determine de 
novo DNA sequence, to compare two DNA sequences and to monitor gene expression profiling. 
z.0 [01811 PAP is highly sensitive to mismatches along the length of P* in PAP with one P+ and 
one opposing unclocked oligonucleotide. The specificity of PAP is also affected by P* length 
and mismatch. If the ailele-specific nucleotide of P* is at the 3' terminus, only the specific allele 
is amplified and the specificity is not associated with P* length. If the allele-specific nucleotide 
is not at the 3' terminus of P+, the specificity is associated with P* length. 26 mer P* has a 3' 
as specific subsequence of three-nucleotides within this region any mismatch inhibit the 
amplification. 1 S-mer has a 3 ' specific subsequence of 1 6 nucleotides. 

[0182[ BUPAP is a form of PAP. In Bi-PAP with two opposing P*s, each P* has Its own 3' 
subsequence, i.e., within this region any mismatch inhibit the amplification of Bi-PAP. For 
example, when the allele-specific nucleotide of the P* pair is at their 3' termini, only the specific 
3 0 allele was amplified, no matter what the lengths of the P^s are 40, 35 or 30 nucleotides. The 
length of the paired specific subsequence is addition of the P* pair minus one. 
[0183] The length of the paired specific subsequence may be affected by the sequence context 
and siite of each P*> the type of the 3' terminal non-extendibie nucleotide, the template sequence^ 
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the DNA polymerase, other components liks ions, and cycling conditions. When the template 
contains repeated sequences or homogenous polymer runs longer than the length of the P+ pair, 
P* may lose specificity for anchoring , 

[01841 Resequencing is the sequencing of a known region to detect unknown mutations, The 
property of the paired specific subsequence of Bi-PAP can be applied to scanning for unknown 
sequence variants or re-sequencing of predetermined sequences in a parallel way. 
[0185] A Bi-PAP resequencing is shown in Pigs. 22, 23 A, 23B, 24A and 23B-. Briefly, the wild- 
type sequence can be determined, and any single base substitution can be determined with the 
type and position. An unknown small deletion and insertion can he detected and localized. In 
order to identify a specific type of deletion or insertion, it is possible to add corresponding Bi- 
PAPs. For fingerprinting, which can provide information regarding mutation position, fewer 
numbers of Bi-PAPs can be used. 

10186] The concept of Bi-PAP de novo DNA sequencing mafcea use of the complete set of 
paired specific subsequence of the P* pair to identify the presence of the paired specific 
subsequence in the de novo sequence. 

[0187] Bi-PAP de novo DNA sequencing on mieroanay is shown in Fig. 25. Briefly^ the 
procedure first collects all the Bi-PAP amplifications with their P* pairs and then reconstructs 
the unknown DNA sequence from this collection by ordering the paired specific subsequences. 
[QJ881 For comparison of two DNA sequences to see if they are the same or different, there is a 
simple way to reduce the number of P* pahs by using an incomplete set of the specific 
subsequences of the P* pair. By arranging them in a particular order, it Is possible to identify 
the chromosomal locations as well as tbc sequences. 

[01891 To monitor gene expression profiling, where up to 6x10* to 10 5 transcripts are expressed 
and details of the precise sequence are unnecessary, Bi-PAP can be applied. A set of P* pairs 
which can specifically amplify unique motifs in genes can be designed for Bi-PAP. 
[0190] This property of the base pairing specificity of Bi-PAP can be applied to scan for 
unknown sequence variants, to determine de novo DNA sequence, to compare two DNA 
sequences and to monitor gene expression profiling. A Bi-PAP array is possible. Each pair of 
two opposing P*s can be immobilized at an individual spot on a solid support, e.g., microarray, 
thus allowing all the Bi-PAP reactions to be processed in parallel. 

[0191) For PAP, the atidvatable oligonucleotide has a non-ejttendible 3' terminus thsit is 
activatable by pyrophosphorolysis (hereinafter referred to as a non-extcndible y terminus). Any 
3' terminal non-extendible oligonucleotide can be used, if it can hybridize with the template 
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atraud, the 3' tc:rminal nucleotide can be removed by pyrophosphorolysis, and the activated 
oligonucleotide can be extended. Examples of non-extendible ? terminus include, but are not 
limited to, a non-extendibie 3* deoxynucleotide, such as a dideoxymicleotide, or a chemically 
modified nucleotide lacking the 3' hydroxy! group, such as an acyclonucleotide. 
5 Acyclonucleotides substitute a 2-hydroxyethoxymethyl group for the 2'-deoxyribofuranosyl 
sugar normally present in dNMPs. 

[0192] Alternative blocking agents may increase the selectivity of pyrophosphoroloysis for a 
complete match, thereby further enhancing the selectivity of PAP for detecting rare mutations. 
Finally, alternative blocking agents may be less expensive or more readily automatable, thereby 

10 improving the coat-effectiveness of PAP and facilitating PAP microarray-based resequencing. 

[0193] In addition, P*a not blocked with dideoxynucleotides extends the selection of DNA 
polymerases which can be used for PAP. As demonstrated herein, Family I polymerases may be 
used for PAP when the 3' terminal non-extendiblc oligonucleotide contains a di deoxynucleotide 
or an acyclonucleotide. Family O polymerases may be used for PAP when the 3' terminal non- 

15 extendible oligonucleotide contains an acyclonucleotide 

EXAMPLES 

[0194] The invention can be understood ftoni the following Examples, which illustrate that PAP 
can be used to identify a known mutation in a polymorphic site within the human D, dopamine 

20 receptor gene. The effects of the dideoxyoligonuclcotidc sequences, acyclonucleotide 
sequences, DNA polymerases, Pp ( concentrations, allele-specific templates, pH, and dNTP 
concentrations were examined. The experlmenta reported iu the Examples were conducted for 
proof of principle, The following examples are offered by way of illustration and are not 
intended to limit the invention in any manner. Standard techniques well known in the art or the 

25 techniques specifically described therein were utilized, 

EXAMPLE 1 
Preparation of template by PCR 
[0195] A 640-bp region of the human Dj dopamine receptor gene was amplified by PCR with 
30 two primers (T - 5' GAC CTG CAG CAA GGG AGT CAG AAG 3' (SEQ ID NO:l) and U - 5' 
TCA TAC CGG AAA GGG CTG GAG ATA 3' (SEQ ID NO:2)) (Fig. 6A). The TU:UT 
duplexed product spans nucleotides 33 to 672 m GeoBank X55760 and the G+C content is 
55.3%. A common A to G polymorphism is located at nucleotide 229, resulting hi Ihree 
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genotypes of G/G, AJ A and G/A (Liu et al, 1 995). The PCR mixture contains a volume of 50 
ul: 50 mM KC1, lOmM Tris/HCl, pH 8.3, 1.5 mM MgCla, 200 uM each of the four dNTPs 
{Boehringer Mannheim), 0.1 uM of each primer, 2% DMSO. 1 U of Tag DNA polymerase 
(Boehringer Mannheim) and 250 ng of genomic DNA from G/C homozygote, A/A homozygote 
s or G/A heterozygotes. Cycling conditions included: denaturation at 95 U C for 15 seconds, 
armeplmg at 55 °C far 30 seconds, and elongation at 72°C for one minute, for a total of 35 cycles 
(Perkin-Elmer GeneAmp PGR system 960O), The PGR product was purified from primers and 
other small molecules by approximately 10,000-fold by three times of retention on a Centricon® 
100 microconcentrator (Amicon). The amount of recovered PGR product was determined by 
10 UV absorbance at 260 nm. 

Synthesis of P* by adding a 3'-didcoxymjcleotide 
[01961 The deoxynucleoiide oligonucleotide was synthesized by Perceptive Bioayatems 8909 
Synthesizer (Framinsham) and purified by oligopure cartridges (Hamilton) in the City of HopB 
DNA/RNA Chemistry Laboratory. The 3' terminal dldeoxynndeotide was added by terminal 
is transferase. The mixture contained a total volume of 40 pi: 200 mM potassium cacodyiatc, 25 
mM Tris/HCl (pH 6.6 at 25 °C), 2.5 mM CoCk, 0.25 mg/ml of BSA, 4000 pM of the 
oligonucleotide, 2.5mM 2'3'-ddNTP (the molar ratio of the 3'-OH terminus to ddNTP was 1:25) 
Boehringer Mannheim), 125 U of terminal transferase (Boehringer Mannheim), The reaction 
was incubated at 37°C for 1 hour and then stopped by adding EDTA at 5 mM final 
20 concentration. After desalting by using butane], the dideoxyoligonucleotide was purified by 
preparative ?M urea/20% polyacryl amide gel electrophoresis in TBE buffer {90 mM Tris/boFate, 
ImM EDTA, pH 8.3) {Maniatis et aL, 1982). The amount of the recovered P* was determined 
by UV absorbance at 260 nm. 

[A197J Since small amounts of untenninated oligonucleotide would result in non-specificity of 
pyrophosphorolysis, each dideoxyoligouuckotide was 3J P -labeled at the 5' terminus by T4 
polynucleotide kinase and theti was elcetrophoresed through a 7M urea/20% polyacrylamide gel. 
Only P* products were visible even when the gel was overexposed (data not shown). It is 
estimated thai more than 99.99% of P+ contained a dideoxynucleotide at the 3' terminus. 
Pyrophosphorolysis activated polymerization 
30 [0198] A 469-bp region within the TU:TJT duplexed template was amplified by PAP with 
oligonucleotides P* and U, or with only one P* (Table 1 and Fig. 6 A). The PUrUP duplexed 
product corresponds to nucleotides 204 to 672 in GenBank X55760 and the G+C content is 
55.6%. Unless stated, the PAP reaction mixture conlauted a total volume of 25 uj for Tfl DNA 
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polymerase: 75 mM KC1, 20 mM Tris/HCl {pH 7.4), 1,5 mM MgCfe, 40 uM each of the four 
DNTPs (dATP, dTTP, dGTP and dCTP), 0.2 uM P*. 0.05 uM U oligonucleotide, 300 uM 
Na^PPj (the 20 MM stock solution was adjusted by HCI to pH 8,0), 1 uCi of [a-"P]-dCTP 
(SOOOCi/nmole, Amersham), 1 U of Tfl DNA polymerase (Promega) and 2 ng of TU:UT. For 
5 Tag DNA polymerase, the reaction mixture was the same except for 50 mM KC1, 10 mM 
Tris/HCl (pH 7.4), 2.0 mM MgCl 2 and 1 U of Tag DNA polymerase {Boehringer Maiinheim). 
The mixtures of PGR and other controls were the same except for the primers added. Cycling 
conditions included: 94 Q C for 15 seconds, 55 "C for one minute, ramping to 72 "C for one 
minute and 72 °C for two minutes, for a total of 15 cycles. 

10 

l'ABLE 1 
Oligonucleotides used in PAP 



Tem- 
plate 


a 

5 ' . , . PA T GTG.RC T G AC CC CTfi TTCCCTGCTT GGAAC. . . 3 ' (SEQ ID NO; 3 } 
A 


Name 


Oligonucleotide sequence S'-3' (SEQ ID NO:) 


Purpose 




ACT GACCCCTJVT TCCCTGCTT 1 " (4) 


Control 


DiG' 1 


AC T G ACCCCTATTCCC T G CTTG ' " [S} 


V ddG and G allele specificity 
ce-localiied 


D 2 G 


ACTGAC CCC I At TCCCTG Clt GG * (6) 


G allele specificity 5* to ddG 


D 4 G" 


ACTGACCCCTATTCCCTGCTTGGG* (7) 


G allele specificity 5' to ddG 


D<G' 


ACTG^CCCCTATTCCCTGCTTGGGG* (81 


3' ddG mismatches template 




TCTGACTGACCCCTATTCCCTGCTTG* ■( 9-) 


D,Q", with 5' extended bases 


D 5 A' 


TGACTGACCCCIAIICCCtGCtTA* <1CJ 


3' ddA md A allelc-epeciflcity 
co -localized 


U 


T C ATACOG C AAAG Q GCTGG AG ATA (11) 


Upstream oligonucleotide 


Name 


i' terminal 
nucleotide 0 


Alkie-specific 
nucleotide 4 


(bast) 




Amplification' 


Type 


Msteh 


Type 


From 3' 
termiTOift (bp) 


G allele 


A allele 


D, 


dT 


Yes 




+1 


21 


64 


Yes 


Yes 


D,G' 


ddG 


Yes 


G 


0 


22 


68 


No 


No 




ddG 


Yes 


G 


-1 


23 


72 


No 


No 




ddO 


Yes 


O 


-2 


24 


76 


Yes 


No 


D<G" 


ddG 


No 


G 


-3 


25 


30 


No 


No 




ddG 


Yes 


G 


0 


26 


so 


Yes 


No 




ddA 


Yes 


A 


0 


24 


72 ' 1 


No 


No | 
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|!T~ I OA j Yss | - | ■ 1 24 [72 | Yes j Yea | 

* D|G" was produced by adding a G dideoxymreleotide to the 3' termmua of the d, a 
dideoxyflutleotide at the 3' terminus. 

L The T means the 3 T terminus k T deojtypucleotide and G* means the 3' terminus is G 
dideoxynudeotide. The bold capital O and A are the G and A basea corresponding to Q and A 
s alleles, respectively. The first base at the 5' terminus corresponds to nucleotide 208 in GenBank 
X5S760. 

c The 3 h terminal base is a deoxynucleotide or dideoxyoucleotide, and creates a match (Yes) or a 
mismatch (No) with the corresponding base on the complementary strand of the template. 

d The Hllcle-<tpecific nucleotide is G or A and its distance to the 3' terminus is assigned; 0 - at the 
3' terminus +1 - one base downstream from the 3' tenninua, -1 =• one basic upatream from the 
3' terminus, -2 = two bases upstream from the 3' terminus, and -3 = three bases upstream from the 
3' terminus. 

*The Tm for oligonucleotides whb estimated to be 4°CX (G + C) + 2°CX (T + A) at I M NaCl 
(Miyada and Wallace, 19S7). 
15 ' The amplification with U and one P* or with only one P*. 

[0199] The reaction was electrophoresed through a standard 2% agarose gel. The gel was 
stained with ethidium bromide for UV photography by a CCD camera (Bio-Rad Gel Doc 1000), 
dried and subjected to Kodak X-OMAT™ AR film for autoradiography. 

so Restriction digestion 

|0200] Each of the three restriction endonucJeascs of AciL (^C^CGCS'/B'GGC^G^) Ectel 
(FPy'GGCCPirtV 3 , PuCCGG^Py5'> and EcoQ\09l (5-PuG*GNCCPy3V 3'PyCCNG*GPu5') has 
a restriction site within the PU:UP duplex. The G/G alleles were amplified by PAP with D 3 G* 
and U; PCR ampJification with D| and U was uaed as the control. 40 ul of the PAP reaction and 

25 2 j,d of the PCR reaction were purified and concentrated with a Centricon® 100 
microconoeotrator, and the products digested by the restriction etidonuciease: 2.5 U of Acil m 
IX NE buffer 3; or 3 U of Eael in IX NE buffer I; or 30 U of EcoQl09l in NE buffer 4 with 
BSA (all of the above enzymes and buffers from New England BioLabs). 10 ul of the reaction 
was incubated at 37 °C for 2 hours. The digestion reaction was electrophoresed through a 

3 o standard 2% agarose gel 33 described above, 
Principle of PAP 

f0201] J/? and Tag DNA polymerases were shown to contain pyrophosphorolysis activity, Tfl 
£>NA polymerase was utilized to detect the G allele at nucleotide 229 of Ihe D] dopamine 
receptor gene {Liu et al., 1995) (Fig. 6A). P+ was synthesized with m'ther ddG or ddA at the 
35 ^'terminus (see Table 1). The ^terminal dideoxynodeotide inhibits direct extension by 
polymerization, but can be removed by pyrophosphorolysis in Ihe presence of pyrophosphate 
(PPj) when the P* is specifically hybridized with the complementary strand of the G allele. The 
degraded oligonucleotide can he extended by polymerization in 5'-3'dhection (Figs> 6B and 6C), 
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[0202] The enhanced specificity of PAP relative to PASA is provided by 3eriaily coupling 
pyrophosphorolysis and polymerization. Significant nonspecific amplification requires 
mismatch pyropbosphoroiysis and misincorpcraticn by DNA polymerase, an extremely rare 
event (Fig. 7). 
5 Specific amplification with D5G* and D3G* 

[0203] PAP was performed with two oligonucleotides (P* and U), Tfl DNA polymerase and 
DNA template of the G/G and A/A alleles. Multiple P* were tested (Table 1). DjG* (the allele- 
specific nucleotide and dideoxynucleotide are co-localized ta the 3* terminus and D 3 G* (the 
allele-specific nucleotide h two base* from the 3' terminus) specifically amplified the G allele ul 

10 the presence of PP f (Fig. 8A). Without added PPj, no specific product was observed with DjG 41 , 
indicating that added PP( was an essential component for PAP (Fig. SB, lanes 6 and 15). Faint 
products with D^G* in lane 4 and with D4G* in lane 5 were observed (Fig. 8B) {see below). 

Effects of pH, [PP|] and [dNTP] and enzyme 
J0204] Each of the above parameters was examined. PAP was moat efficient at pH between 7.4 

is and 7.7, at [PP { ] between 200 uM and 400 uM, and at [dNTPsJ between 25 \tM and 50 ufvf 
(Table 2). Tag DNA polymerase can substitute for Tfl with similar efficiencies (Table 2). 



TABLE 2 
Parameters affecting PAP 





Parameter 


PAP efficiency* 






D 5 G*-U 


DjG*-U 


pH* 


8.1 






7.9 








7.7 


-H- 










+++ 




7.4 


++ 


+++ 




7,15 


+ 


+ 


PPi' 

OlM) 


1000 






BOO 




+ 


600 




+4 




400 


-H- 


+++ 




200 


++ 


+++ 




0 




± 


All dNTPs 
changed" 


200 




+ 


100 




± 



WO ltf/095M4 PCT/IIS0J/US5I; 
36 





50 \ '++ 


+++ 


25 


++ 


-H \ \: 


dGTP 
changed lc 


100 




++ 


50 


+ 


++ 


25 


± 


++ 


dATP 
changed"* 


100 




+ 


SO 




+ 


25 




+4- 


Tag DNA 
polymerase 


G allele and PP ( 




+++ 


A allele and PPj 






G allele and no PP; 




+ 


" Tfl DNA polymerase was used to ampii: 


y the G/G alleles under the conditions in 



Materials and Methods, except for the factors indicated 

* The PAP efficiency is indicated as: -, no specific produces); +, very weak specific 
produces); +, weak specific produces); ++, moderate specific produces); -H-f , strong 
5 specific produces); +-H-+ h very strong specific produces), 

c The indicated concentration was changed hut the others were kept at 200 pM. 

Identity of specific products 
1020S] In order to confirm the identity of the specific products, restriction endonuclease 
io digestion was performed (Fig. 9). Each of the three restriction endcnucleasea of Acii, Eael and 
£co0109 has a restriction site with the PU:UP duplex. The expected restriction fragments were 
found, Similar results were observed with D 3 G* and U. 

[0206] The specific products of PAP with D 3 G* and U revealed two specific bands on the 
agarose gel, l.e., PU:UP and UP; because U was more efficient than B s G* r under our 

i s amplification conditions, In order to confirm this, the G/G alleles were amplified by PAP using 
Tfl DNA polymerase with DjG* and U as previously. The products were denatured and 
electrophoresed through a denaturing polyacrylamide gel. Only one specific band in single- 
stranded form was observed, indicating that the specific PAP products: contain the duplexed and 
single stranded segments. The same result was observed with D^G* and U. 

20 Linear PAP 

[0207] PAP was performed for linear amplification with only one F* from the G/G and A/A 
alleles in the presence of PPj. The specific products of PAP were obtained with DjG* and with 
DjG*, but not with the other P* (Fig, 10, lanes 4 and 6). The efficiency of P+ was afTected by 
the oligonucleotide size, the 3'-tenninal dideoxynucleotide and the position of the allele-specific 

35 nucleotide. 
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[0208] Figs. 6A-6C show schematic of PAP. Fig. 6A. A duplexed DNA template TU:UT is 
amplified with two oligonucleotides P* and U, Tfl DNA polymerase, dl^JTPs, pyrophosphate and 
[a- 3J P]-dCTP. P* - pyrophosphorolysis activatable oligonucleotide, In this example P* is 
D5G* and TU:UT is a 6"40-bp segment of the dopamine Dt receptor gene, Fig. 6B. DjG* has a 
s G dideoxynucleotide at the 3' terminus, and it is specific to the complementary strand of the G 
allele, but mismatches the A allele at the 3' terminus (Table 1). Removal of the dideoxy G by 
pyrophosphorolysis is followed by polymerization for each amplification, Fig. 6C. 
Autoradiogram of PAP from the GIG, A/A and G/A genotypes. When the G allele is present, 
the radioactively labeled specific products of 469 bases (duplex PU:UP and excess antisense 

io strand UP) are produced, since the low rate of pyrophosphorolysis by Tfl polymerase implies 
that oligonucleotide U has a much higher efficiency than oligonucleotide P*. Electrophoresis 
for a longer period separates PUrUP from UP. Other products of UT and UT:TU are indicated. 
Note that TU:UT derives from annealing of excess r&dioactivcly labeled UT with non- 
radioactively labeled TU original template. PAP was also performed with D3G+ and U from the 

is G/G, A/A and G/A genotypes, and similar results were obtained. 

[0209] Figs. 7A-7B show enhanced specificity of PAP with D A G*. The specificity of PAP is 
compared with that of PASA to exponentially amplify a template pool of G and A alleles, Fig. 
7A. The specific amplification of PASA derives from the high efficiency of primer extension 
when the primer matches the G allele. The nonspecific amplification results from mismatch 

20 extension from the A allele. When this occurs, it results in an efficiency substrate for further 
amplification. The thickness! and, position of the arrow represent the amplification efficiency in 
each cyele. Fig. 7B. The specific amplification of PAP from the G allele occurs at high 
efficiency. Two types of nonspecific amplifications originate from the A allele: (i) nonspecific 
amplification can occur at low efficiency by mismatch pyrophosphorolysis resulting in a A:T 

25 homo-duplex PU:UP product, which is not an efficient template for subsequent amplification; 
(ii) nonspecific amplification can occur at extremely low efficiency by both mismatch 
pyrophosphorolysis and misincorporation to produce a G:T hetero-duplex PU:UP product, but 
once it occurs, it provides an efficiency template for subsequent amplification. A similar 
tendency of nonspecific amplifications is suggested for linear amplification by PAP with oniy 

3 a DjG*. It should be noted that allels-specific nucleotide of P*, such as DjG*, may be near but 
not at the 3' terminus. In that case nonspecific amplification of PAP requires both mismatch 
pyrophosphorolysis and mismatch extension. While both variations of PAP should have higher 



WO 03rtiy56W 



PCT/USim455G 



WO (KJ/Q95&fiJ 



38 

specificity than PASA, the highest specificity is predicted when the 3' terminal dideoxy 
nucleotide is also the allele-specific nucleotide. 

10210] Figs, 5A-8B show specific amplification with D 3 G* and D 3 G*. PAP was performed in 
the presence (Fig. 8A) dt absence (Fig. &B) of added PPj with two oligonucleotides for 
5 exponential amplification. The oligonucleotides are listed in Table 1. Extension controls with 
only U identify the positions of TU:UT and UT. Extension controls with D, identify the 
position of P(J. PCR controls of D| and U identify the positions of PU:UP and PU:UT. Only 
20% of the extension reaction with D\ and the PCR reaction were loaded relative to other lanes. 
[0211] Fig. 9 shows restriction endonucles.se digestion. To show specificity of PAP, samples 

10 from the experiment shown in Fig. 8 were digested with Acii, Eael and £eoQlQ9I restriction 
eudonucleases, Each enzyme has a restriction site within PU:UP. PAP amplified the G/G 
alleles with D3G* and U, and 5% of PCR reaction with Di and U were taken as control. Acii 
produces a 236 bp and a 233 bp fragments from PU;UP and a 4-07 bp and a 233 bp fragments 
from TU:UT. Sael produces a 289 bp and a i 80 bp fragments from PU:UP and a 460 bp and a 

15 ISO bp fragments from TU:UT. £co0109I produces a 348 bp and a 121 bp fragments from 
PU:UP and a 107 bp, a 412 bp and a 121 bp fragments from TU:UT. The arrows indicate the 
digestion products expected from PU:UP. 

[0212] Fig, 10 shows linear PAP^ PAP was performed with only one P* in the presence of 
added PP,. 20% of the. reaction with D| was loaded relative to other lanes (lanes I and 10). No 

20 ^ no oligonucleotide added. 

Enhanced specificity of PAP with D 5 G* 
[0213] Example I provides evidence that pyrophosphorolysis followed by polymerization may 
be used to increase the specificity of PASA. Significant nonspecific amplification requires the 
aerial coupling of the two types of errors (Fig. 7). The mismatch pyrophosphorolysis rate to 

25 remove a mismatch deoxynucleotide at the 3' terminus, expressed as the removal rata of an 
incorrect versus a correct dNMP, was reported at less than 10* S for T7 DNA polymerase 
(Kornberg and Baker, 1992; Wong et aL, 1991). The misincorporation rate to create a 
substitution mutation by polymerization, expressed as the incorporation rate of an incorrect 
versus a correct dNMP, was reported as to be 10"* for T7 DNA polymerase and to be 10** for 

30 Kcoli DNA polymerase I (Kornberg and Baker, 1992; Wong et rd., 1991; Bebenek et al., 1990), 
Simitar results were reported for Taq DNA polymerase and for 3-5' exonucleasc-defieient 
mutants of T7 DNA polymerase arid E. call DNA polymerase I (Kornberg and Baker, 1992; 
Wong et ah, 199 U Eckcrt and Kunkel, 1990). The specificity due to the (i) nonspecific 



amplification in PAP with D5G* is estimated to be 10' 5 per cycle, if the mismatch 
pyrophosphorolysis rate of a ddNMP is die same as dNMP. The specificity due to the (ii) 
nonspecific amplification is estimated to be 3.3x10"", if the mismatch pyrophosphorolysis and 
the misincorporation are serially coupled. 
5 Essential components of PAP 

10214] Each P* was tested by utilizing Tfl or Taq DNA polymerases to amplify the G/G and 
A/A alleles. The specific amplification requires the presence of PPj and allele-specific template, 
In addition, the amplification efficiency is affected by the oligonucleotide size, the 3' terminal 
dideoxynucleotide, the position of the allele-specific nucleotide relative to the 3' terminus of P*. 

10 [0215] It is not clear why D|G* and D2G 1 * did not generate the specific signals, hut it may be 
related to a threshold stability of duplex between P* and the template. Ds A*, which contains A 
dideoxynucleotide at the 3 f terminus, did not generate the specific signal* which may be 
associated with different incorporation efTicianoiea of doNTPs by polymerization. Klenow 
fragment of B. colt DNA polymerase I, Taq DNA polymerase and aTaq DNA polymerase 

is incorporate ddGTP more efficiently than other ddNTPs (Sanger et al., 1977; Tabor and 
Richardson, 1995; Vander Horn et al., 1997). The rate of ddNTP incorporation also varies 
depending on the template sequencs and can be 1 0-fold higher at some bases relative to others 
(Sanger ct al., 1977). Another possibility is that D&A* is shorter in size with a lower T trl . 
[0216] In PAP without added PPj, very faint false signals were generated with DjG* and with 

2 0 D4G* (Fig. 8B). One possibility is that oligonucleotide dimers can form and trigger nonspecific 
pyrophosphorolysis of P* in later cycles after "endo-" PPj is released from the by-polymerization 
to generate UT. 3'tftnrnnal degraded DiG* and D 4 G* can be hybridized and extended as false 
signal. Oligonucleotide dimers were observed with D 3 G* and D4G*, Another possibility with 
DjG* is that the specific pyrophosphorolysis can occur in later cycles after "endo-" PP t is 

2S released. A third possibility is that D 3 G* and TJ4G+ were contaminated by minimal D 3 and D 4 
which were not fully added by G dideoxynucleotide at 3' termini. 

Comparison with other technologies 
[0217] A number of methods for enzymatic nucleic acid amplification in vitro have been 
developed and can be adapted to detect known sequence variants. These include polymerase 

30 eh&in reaction (PCR) (Saiki ct al., 19S5; Saiki et aL, 1988), ligase chain reaction (LCR) 
(Landegren et al., 1988; Barany, 1991) and rolling circle amplification (RCA) (Lizardi et al., 
1998; Baner et al., 1998). PAP is different in many ways: i) pyrophosphorolysis and 
polymerization are serially coupled for each amplification, ii) there is at least one 
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dideoxyoligonucleotide for PAP. Other chemically modified nucleotides lacking the 3 T - 
hydroxyl group at the 3' terminus, such as acyclonucleotides can serve the same function (see 
Example 12 below), iii) one formal is for linear amplification and the other is for exponential 
amplification, iv) ?P; is necessary for the amplification, v) significant nonspecific amplification 
requires both mismatch pyrophosphorolysis and misincorporation, vi) PAP can detect known 
point mutations and greatly increase the specificity to detect an extremely rare mutant allele 
from the wild-type allele. 

10218] The mechanistic basis is that two or more reactions are serially coupled for amplification 
with increased specificity. The key component of PAP is a pyrophosphorolysis activatable 
oligonucleotide. The blocked 3' terminus in these experiments is a dideoxy nucleotide, but any 
non-extcndible nucleotide susceptible to pyrophoaphorplysis could in principle be substituted. 
Indeed, any enzyme that cleaves an oligonucleotide 5' to a mismatch could serve the same 
function as pyropliosphoro lysis activation. For example, a blocked oligonucleotide including the 
methylated recognition sequence (such as G m ATC) is annealed to its target with the 
unmethylated recognition sequence, then restriction endonuclease (such as Dpnf) can only 
cleave the methylated silc and so activate the oligonucleotide for extension. If a mismatch is 
located 5' to the cleavage site, significant nonspecific amplification requires the serial coupling 
of mismatch cleavage and a misincorporation, which is a rare event. Activatable 
oligonucleotides may also be combined with "rrnnisequencing" primer extension. This may 
provide a more specific assay for detection of single base changes that might be particularly 
amenable to chip technology in which specificity can be a problem (Syvanen, 1999). 
Demonstration that PAP can occur in the linear format (Fig. 10) supports the feasibility of this 
approach. 

[0219] Nucleoside triphosphates and 2'-deoxyoucieoside triphosphates or their chemically 
modified versions may be used as substrates for multiplc-nucleotide extension by PAP, i.e., 
when one nucleotide is incorporated the extending strand can he further extended, 2',3'- 
didcoxynucleosidc triphosphates or their chemically modified versions that are terminators for 
further extension may be used for single-nucleotide extension. 2' p 3'-dideoxyjxucleoside 
triphosphates may be labeled with radioactivity or fluorescence dye for differentiation from the 
3' terminal dideoxynucleotide of oligonucleotide P*. Mixtures of nucleoside triphosphates or 2'- 
deoxynucleotide triphosphates and 2 , ,3 , -dideoxynucleoside triphosphates may also be used, 
[0220] In PAP, specific nucleic acid sequence h produced by using the nucleic acid containing 
that sequence as a template. If the nucleic acid contains two strands, it is necessary to separate 
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the strands of the nucleic acid before it can be used as the template, either as a sqparate step or 
simultaneously. The strand separation can also be accomplished by any other suitable method 
including physical, chemical or enzymatic means. 

|022I] When it is desired to produce more than one specific product from the original nucleic 
s acid or mixture of nucleic acids, the appropriate number of different oligonucleotides are 
utilized. For example, if two different specific products are to be produced exponential ly, four 
oligonucleotides are utilized, Two of the oligonucleotides ( P+£l ) are specific for one of the 
specific nucleic acid sequences and the other two oligonucleotides ( P*£I) are specific for the 
second specific nucleic acid sequence. In this manner^ each of the two different specific 
10 sequences can be produced exponentially by the present process. 

[0222) The DNA or RNA may be single- or double-stranded, may be a relatively pure species or 
a component of a mixture of nucleic acids, and may be linear or circular. The nucleic acid or 
acids may be obtained from any source, for example, from plasmid, from cloned DNA or RNA, 
or from natural DNA or RNA from any source, including bacteria, yeast, viruses, and higher 

is organisms such as plants or animals. DNA or RNA may be extracted from blood, tissue materia* 
such as chorionic villi or amniotic cells by a variety of techniques such as that described by 
Maniatis et al. (19S2). 

[0223) The P+ oligonucleotides arc selected to be "substantially 1 ' complementary" to the 
different strands of each specific sequence to be amplified. Therefore, the P* oligonucleotide 

20 sequence need not reflect the exact sequence of the template. For example, a 
non-complementary nucleotide segment may he attached to the 5 T -end of the P" oligonucleotide, 
with the remainder of the P* oligonucleotide sequence being complementary to the strand. 
Alternatively, non-complementary bases or longer sequences can be interspersed into the P+ 
oligonucleotide, provided that the P* oligonucleotide sequence has sufficient complementarity 

25 with the sequence of the strand to he amplified to hybridize therewith and form a template for 
synthesis of the extension prodnct of the other P* oligonucleotide. As Used in the claims, the 
term "complementary" should be understood to mean "substantially complementary," as 
discussed herein, 

[02241 -toy specific nucleic acid sequence can be produced by the present process. It is only 
3 o necessary that a sufficient number of bases at both ends of the sequence be known in sufficient 
detail so that two oligonucleotides can hybridize to different strands of the desired sequence at 
relative positions along the sequence, The greater the knowledge about the bases at both ends of 
the sequence, the greater can be the specificity of the oligonucleotides for the target nucleic acid 
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sequence, and thus the greater the efficiency of the process. It will be understood that the word 
oligonucleotide as used hereinafter may refer to more than one oligonucleotide, particularly in 
the ciise where there is some ambiguity in th& information regarding the terminal sequence(s) of 
the segment to be amplified. 

[0225] The present invention can be performed in a step-wise fashion where after each step new 
reagents are added, or simultaneously, where all reagents are added al the initial step, or partially 
step-wise and partially simultaneous, where fresh reagent js added after a given number of steps. 
The simultaneous method may be utilized when an enzymatic means is used for the strand 
separation step. In the simultaneous procedure, the reaction mixture may contain the 
strand-separating enzyme (e.g M helicase), an appropriate energy source for the strand-separating 
tnfcymc, such as ATP. Additional materials may be added as necessary, 

[0226] The nucleic acid polymerase may be any compound or system that will function to 
accomplish the amplification. Suitable enzymes for this purpose include, for example, 'JJI DNA 
polymerase, Taq DNA polymerase, E. coll DNA polymerase I, Klenow fragment of E. coli DNA 
polymerase I, T4 DNA polymerase, T7 DNA polymerase, other available DNA polymerases, 
RNA polymerases or their variants, reverse transcriptase or its variants, and other genetic 
engineered versions, It is predicted on the basis of the relationship between reverse and forward 
reactions that a DNA polymerase will have high and even pyrophoaphoto lysis activity for the P* 
aclivatable oligonucleotide, if it incorporates tldNTFe efficiently (compared with dNTPs) and 
evenly (compared among the four ddNTPs). Of all the DNA polymerases, the genetic 
engineered version may be the best in the future, such as ThermoSequenase (Vander Hom et al., 
1397), Generally, the synthesis will be initiated at the 3' end of each oligonucleotide and proceed 
in the 5' direction on the template strand. However, inducing agents which initiate synthesis at 
the 5' end and proceed in die other direction can also be used in the PAP method as described 
above. 

EXAMPLE 2 
Preparation of template by PCR 
[02271 A 640-bp region of the human Di dopamine receptor gene was amplified by PCR with 
two primers (T - 5 1 GAC CTG CAG CAA GGG AGT GAG AAG 3' (SEQ ID NO:l) and U = S' 
TCA TAG CGG AAA GGG CTG GAG ATA 3' (SEQ ID NO:2)), The TU:UT duplexed product 
spans micleoridea 33 to 672 in GenBank X55760 and the G+C content of the product is 55% . A 
common A to G polymorphism is located at nucleotide 229, resulting in three genotypes of Q(Q t 
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A/A and G/A. The PCR volume is 50 ul: 50 mM KC1, 10 mM Tris/HCl, pH 8.3, L5 mM MgCIj, 
200 jiM each of the four dNTPs, 0.1 nM of each primer, 2% DMSO, 1 U of Taq DNA 
polymerase (Boehringer Mannheim) and 250 ng of genomic DNA from G/G homozygote, A/A 
homozygote or G/A heterozygotes. Cycling conditions included: demturation at 94°C for 15 
5 sec., annealing at 55*C fbT 30 sec, and elongation at 72 °C for one min., for a total of 35 cycles 
with a GeneAmp PCR System 9600 (Parkin-Elmer Applied Biosystems). The PCR product was 
purified from primers and other small molecules by approximately 1 0,000-fold by three times of 
retention on a Centricons LOO micmconcenttator (Amicon). The amount of recovered PCR 
product was determined by UV absotbance at 260 nm. 
10 Synthesis of P* by adding a 3' dideoxynucleotide 

[Q228J The deojcynlieleotide oligonucleotide was synthesized by Perseptive Biosystems S909 
Synthesiser (FraminBham) and purified by oJigopure cartridges (Hamilton) in the City of Hope 
DNA/RNA Chemistry Laboratory. The 3' icmunal didooxynucleottdc was added by termin*] 
transferase. The mixture contained a total volume of 30 uh 100 mM potassium cacodylate (pH 
is 7.2), 2.0 mM CoClj, 0,2 mM DTT, 2500 pM of the oligonucleotide, 2 mM 2\ 3-ddNTP (the 
molar ratio of the 3'-OH Senninus to ddNTP was l:24){Bcehringer Mannheim), 100 U of 
terminal transferase (GIB CO BRL). The reaction was incubated at 37°C for 4 hr and then 
stopped by adding EDTA at S mM final concentration. After desalting using a Centri-spin™ 
column (Princeton Separations), P* was purified by preparative 7 M urea/20% polyacrylamide 
20 gel electrophoresis in TBE buffer (90 mM Tris/borate, I mM EDTA, pH S.3) (Maniatis et a!., 
1982). The amount of the recovered P* was determined by UV absorbance at 260 nm. 
[02291 Since small amounts of untcrminatcd oligonucleotide would result in nanspecificity af 
pyrophosphorolysis, each P* was "P-labeled at the 5' terminus by T4 polynucleotide kinase and 
then was electrophorcsed torough a 7 M urea/20% polyacrylamide gel. Only P* products were 
25 visible even when the gel was overexposed. It is estimated that more than 99.99% of P* 
contained a dideoxynucleotide at the 3' terminus. The purity of P* was supported by the absence 
ofPCR product orPAP product at pH 8.3. 

Pyrophosphorolysis activated polymerization 
10230] Regions from 445 to 469 bp within the T(J:UT duplexed template were amplified by 
30 PAP with oligonucleotides P* and U, or with only P*. The PU:UP duplexed product 
corresponds to nucleotides 204-228 to 672 in GonJBank X55760 and its G+C content is 56%, 
The PAP reaction mixture contained a total volume of 25 ul: 50 mM KCl t 10 mM Tris/HCl (pH 
7.6), 1.5 mM MgCl 2 , 100 pM e*ch of the four dNTPs (dATP, dTTP f dGTP and dCTp), 0-1 uM 
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P*. 0.1 uM U chgonucleotide (TCArACCGGAAAGGGCTGGAGATA (SEQ Tt> NO:2», 300 
uM Ha^PPi, 2% DMSO, 1 ^Ci of [ct- 32 P] dCTP (3000Ci/mmoIe, Amersham), 1 U of 
Tunplira^-FS DNA polymerase (PE Applied Bfosystems) or 0.5 U of each of AitopliJa^FS and 
Taq DNA polymerases, and 10 ng of TU:UT. ThennoSequenase (Amersham Pharmacia) wag 
also tested under the same conditions except for 8U TnermoSequenase or 4U ThermoSequenase 
plus 0.5U Taq and 2.5mM MgCl 2 . The cycling conditions included: denaturation at 94 B C for 10 
sec, annealing at oO^C for i miu, {at 55 °C for ThermoSequenase), and elongation at 72 6 C for 2 
min., for a total of 15 cycles. 

10231] The product was elecrrophoresed through a standard 2% agarose gel. The gel was stained 
with ethidiimi bromide for UV photography by a CCD camera (Bio-Rad Gel Doc 1000) arid 
Multi -Analyst* software, dried and subjected to Kodak X-OMAT™ AR film for 
autoradiography. The PAP yield was quanrjtated with a Phosphorlmager with IniageQuant 
software (Molecular Dynamics) as the total number of pixels in the PGR band minus the 
background, indicated as a random unit. 

Enhanced PAP efficiency 
[0232] In Example l f only the P* with ddG at the 3' terminus was amplified using native 2JI or 
Taq DNA polymerase Amplira^FS and ThermoSequenase DNA polymerasea were found to 
achieve much higher PAP efficiency with much less discrimination against any kind of 
dideoxynucleotidc (ddAMP, ddTMP, ddGMP or ddCMP) at the 3' terminus of P*. For example, 
P*(2l2)l&G° and P*{2l2)laA <3 1 which are l&-rner& of the dopamine D| receptor gene but have 
ddGMP and ddAMP at the 3' termini (Table 3), specifically amplified the G and A alleles, 
respectively, Their yield ratio was 1,4 (compare lanes 9 with 11 in Fig. 11B), and bo 
P*(212}I8G D is estimated to he 4% more efficient per cycle than P*(212)18A°. Another 
P*(228)26A"" - 5' TAG G AACTTGGGGGGTGTCAGAGCCC* 3' (SEQ ID NG:12), which is 
a 26-mer with ddCMP at the 3' terminus, was amplified as efficiently as a primer without 
ddCMP at the 3' terminus, and the yield was estimated to be increased 1,000 fold compared with 
that by using Tfi or Taq, Moreover, PAP amplified segments directly from human genomic 
DNA. 
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[0233] AmphTafFS has two mutations compared with native Tag. One mutation in the 5' 
nuclease domain eliminates 5'-3' exonncleafie activity and the second mutation in the 

active site (Inrus and Gelfand, 1999). ThermoSequenase has the same mutation F667Y in the 
active site hut a deletion of the 5-3' exonuclease domain {Tabor and Richardson, 1995; Van der 
5 Horn et al., 1997). They do not distinguish between dNTP and ddNTP for incorporation. The 
pyrophosphorolysis of ddNMPs, which is the reverse reaction, is supposed to be much higher 
and less discriminated by these enzymes. Although either AmphTdtfFS or ThermoSequenase 
DNA polymerases used was formulated to contain a thermostable pyrophosphatase 
(manufacturers' instructions) that can hydrolyse FP< in the reaction so as to decrease PAP 

id efficiency, PAP was still amplified under our conditions. AmpUJhgFS and Thermo Sequenase 
DNA polymerases will work better in their pure form without the contaminated 
pyrophosphatase. 

The 3' specific subsequence of P* 
[02341 Various P*s were examined with different lengths and miamatches using AmpliT^FS 

15 (Table 3). The effect of length and mismatch on PAP efficiency is expressed as the relative yield 
(%} between two P*a of different lengths from the same template (Fig, 12), which varied from 
0.0% to 201.5% with each two to four less bases in length. The specificity of PAP is also 
affected by P* length and mismatch (Table 3). The noise rate (%) is defined as the relative yield 
of the mismatch product to the match product and a specific signal is scored with <10% noise 

2d rate. . If the allele^specific base of P* was at the 3' terminus, only the specific allele Was 
amplified and the specificity was not associated with P* length (Fig. 1 2 A), tf the allele-specific 
base was not at the 3 1 terminus of P*. the specificity was associated with P* length. Any 
non-3'-terminal mismatch in the IS-mer P*, which was up to 15 bases from the 3' terminus, 
caused no amplification (Pigs, I2S-12E), but even two such mismatches in the 26-mer P* 

25 caused non-specific amplification. 

[0235] The 18-mers were further examined using "stacked" P*s, which span the allele-specific 
base at different positions (Fig. 13 and Table 4). The noise rate {%) varied from 0.0% to 7,1%. 
Tha length of the 3 1 specific subsequence was £ 13 bases. 
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TABLE 4 

PAP specific^ with differently positioned P*s 



P*(Z12)1SA-' 
P*(214)1SA* 



Nam* 
Template 



P*(2l2> 1SG U 



P*(218) 18G' 



r*"(2"2T>T8CT 
P'(224) 1SG" 



E(6E0lDNO:)- 



5 T GACTGACCCCTMtCCCTGCTT-GGAACTTGAGGGGTGTC 



5 'ACCCCTATrCCCTGCTTA- {34) 



5 'TTCCCTGCTTGGGAACT A (35) 



5 1 CCCTGCTTQGGflft.CTTGA*' p€3 



5 ' TGC1"TgGGAACTT:GAGGG* (37) 



Name 


3' terminal 
dtdcoxy 


Allete-speciflc 
but 


Tm(°q 


Noise rs«e (%)' 


Type 


DJst 
anct 


Exponential 
PAP 


Linear PAP template 


P*(Z12J18G U 


ddG 


G 


0 


56 


2.7 


"0.0 ' ' 


P*(212)18A n 


ddA 


A 


0 


54 


3.8 


1.1 


P*(214)18A" 


ddG 


A 


-2 


St 


4.7 


0,0 


P*(2lS)lSO-° 


ddT '" 




-6 


"54 


0.0 


0.0 


P+(22l)l&G' 


ddA 


G 


-9 


5d 


1.7" 


1.7 


P'(224)I8G- J 


ddG 


G 


-12 


26 


7-1 


0,6 



with one P*. Hit noise rate of PAP (%) is the relative yield of the non-specific allek product 1o 
the specific allele product. 

[0236] Similar results were obtained by using P*s which match and mismatch ihu G allele at 
different positions {Table 5). The noise rate with one mismatch was various from 0.8% to 5.6%. 
The Jength of the 3' specific subsequence was > 16 bases. The noise rate with two mismatches 
was 0% (compare lane 2 with lanes 10- 15 in Fig. 1 4). 
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[0237] Linear PAP was examined using only IS mer P*s and higher specificity was observed 
with lower noise rate (Tables 4 and 5). Linear PAP takca a different mechanistic pathway in 
which every non-specific product is generated from Ihe starting template which requires 
mismatched pyrophosphorolysis with the 3 r terminal mismatched P*, or both mismatched 
pyraphosphonjlysis and mismatched extension with the non-3' terminal mismatched P*. 
|0238] PASA was performed with 17-mer primers without adding a ddNMP at the 3' terminus 
(see Tables 4 and 5). A mismatched 17-mer primer strongly amplified a nonspecific product 
with 30% noise rate when the mismatch was as near as 6 bases to 3' termmus, showing a much 
shorter 3' specific subsequence. Similar results were reported elsewhere previously (Sarkar et a]., 
1990). 

[023d] In summary, f" (1-Iength) has two subsequences: a 3' specific subsequence (n « the 
number of bases of the 3" specific subsequence £1) determines the specificity, i.e.,within this 
region any mismatch to its complementary strand of the template results in no substantial 
amplification; and a 3' enhancer subsequence (m « the number of bases of 5' enhancer 
subsequence 0) enhances the amplification efficiency. PAP specificity is co-detemtiued by the 
base pairing specificity of the 3' specific subsequence, the pyrophosphonjlysia specificity and 
the polymerization specificity. Thus, the base pairing specificity of the 3' specific subsequence is 
a minimum requirement of the PAP specificity. 

[0240] The length of the 3' specific subsequence of P* may be affected by the sequence context 
and size of the P+, the type of the 3' termmal dideoxynucleotide, the template sequence, the 
DNA polymerase, other components like ion, and cycling conditions. When the template 
contains repealed sequences > 1 or homog* 040 " 3 polymer rune > 1, P* loses specificity for 
anchoring. The length of the 3' specific subsequence of P* may be affected by the sequence 
context and size of the P*. the type of the 3' terminal dideoxynucleotide, the template sequence, 
the DNA polymerase, other components like ion, and cycling conditions. When the template 
contains repeated sequences > 1 or homogeneous polymer runs > 1, P* loses specificity for 
anchoring > 

Scanning or Resentencing for Unknown Sequence Variants 
[0241] The property of the V specific subsequence of P* can be applied to scanning for 
unknown sequence variants or re-sequencing of predetermined sequences in a parallel way. Each 
nucleotide on the complementary strand of the predetermined a&quene* is queried by four 
downstream P*s, such as IS-mers (Pig- 1 1), which have identical sequence except that at the T 
terminus, either ddAMP, ddTMP, ddOMP or ddCMP corresponds to the wild-type sequence and 
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the three possible single base substitutions. The number of P*s scanning the complementary 
strand of X bases is multiplication of 4 and X, which is suitable for either exponential or linear 
PAP. The four downstream P*s can even be immobilized on a single dot when ddAMP, ddTMP, 
ddGMP and ddCMP at the 3' termini are labeled differently for diffenmtiation, such as by four 
5 fluorescence dyes. The amplification signal can thus be represented by intensity decrease of 
each dye when ddNMP is removed from P* by pyrophosphorolysis. One advantage of linear 
PAP is that the four ddNTPs can be used as substrates for single base extensions, with are 
labeled with different dyes for differentiatiou. 

{02421 Briefly, if only all the P*s corresponding the wild- type sequence are specifically 
10 amplified, ifie wild-type sequence can be arranged in order by analyzing overlaps. A P* with & 
single base substitution at the 3' terminus is amplified at the position of hemi- or homo-point 
mutations. The mutation also creates a "gap" Of no PAP signal, which spans a region of several 
successive nucleotides. For single haae substitution, the gap size (bases) + 1 = the length of the 
3' specific subsequence. 

is [0243] Furthermore, we can also scwi the sense strand by designing a second set of upstream 
P*s* An unknown single base substitution can he dBtexmined by combination of the two sets of 
P*s, even in heterozygotes. An unknown small deletion and insertion can be detected and 
localised. In order to identify a specific type of deletion or insertion, it is possible to add 
corresponding P*s. For fingerprinting, which can provide inibrmation of mutation position, 

20 there is a simple stacking way that the stacked region of each two successive P*s < the 3' 
specific subsequence on the array to reduce the number of P*s by up to n fold. 

Determination of de novo DNA sequence 
[0244] The concept of de novo DNA sequencing by PAP makes use of all the possible 3' 
specific subsequences of P* to identify the presence of the 3' specific subsequence in de novo 

25 sequence. A complete set of the 3' specific subsequences of P* is 4 n . Each of the 3 T specific 
subsequence has a complete subset of the 5' enhancer subsequence of 4 m . For example, a 
complete set of 16-mer as the 3" specific subsequence and 2-mer as the 5' enhancer subsequence 
can be indicated as (A, T, G> C)(a, T, G, C) N ia - 4 lB . 

[0245] Briefly, the procedure first determines the list of all the specific PAP amplifications and 
3 o then reconstructs the unknown DNA complementary sequence from this list by ordering the 3' 
specific subsequences with the given length by using the Watson-Crick pairing rules, 
[0246] The assembly process is intermpted wherever a given 3' specific subsequence of P* is 
encountered two or more times. One of the factors influencing the maximum sequencing length 
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is the length of the 3' specific subsequence. The length of a random sequence that can be 
reconstructed unambiguously by a complete set of the 3' specific subsequence with, the given 
length is approximately the square root of the number of the 3' specific sequence in the complete 
set with £50% possibility that any given 3' specific subsequence is not encountered two or more 
times. Oclamers of the 3" specific subsequence, of which there are 65,536, may be useful in the 
range up to 200 bases. Decanucleotides, of which there are more than a million, may analyze up 
to a kilobase tie novo sequence. 18 mer P*s containing 16 met as the 3" specific subsequence, 
which complete set is 4 ,s of P*s, may sequence maximum 77,332 bases. 
[02471 When there is neighbored known sequence to design an opposite oligonucleotide for 
PAP with two oligonucleotides. The maximum sequencing length is mainly limited to the 
opposite oligonucleotide, but not to the length of the 3' specific subsequence of P* termed 
Conditional de novo DNA sequencing. 

Other Applications for PAP 
102481 For fingerprinting which compares two DNA sequences to see if they are the same or 
different, there is a simple way to reduce the number of P*s by using an incomplete set of the 3 1 
specific subsequences. By arranging them in a particular order, it is possible to identify the 
chromosomal locations as well as sequences. Considering the 3 x 10 ? bp DNA in human 
genome, PAP with two oligonucleotides is preferred over PAP with only one P* to increase the 
specificity. 

[0249] To monitor gene expression profiling, where up to 6 x 10* to 10* transcripts are 
expressed and details of the precise sequence are unnecessary, PAP with only one P* can be 
applied and a set of P* which identify unique motifs in genes can be designed with a total length 
of up to 22- mer. Between each two P+s, there is at least a sequence difference at the 3" terminus 
or £ 2 sequence differences at the non-3' terminus. 

Comparison with Sequence by Hybridization 
[02 50] In SBH by using oligonucleotide, the DNA sequence is determined by the hybridization 
and assembly of positively hybridizing probes through overlapping portions. Ht has been known 
for a long time that a single oligonucleotide hybridization on a immobilized sample can be very 
specific in optimal hybridization and washing conditions [Wallace el al., 1979), thus it is 
possible to discriminate perfect hybrids from ones containing a single internal mismatch. The 
oligonucleotides in array are 11-20 nucleotides in length and have 7-9 bases specific region in 
the middle, the non-specific signal is generated by mismatched hybridization. Under standard 
hybridization and washing conditions, the duplex stability between match and mismatch is also 



53 

affected by the terminal mismatch and the flanking sequence (Drmanac et al,, 1989; Khrapko et 
al., 1939; Ginot, 1997). 

(023 1] SHB can be modified with enzynws in several ways {Miyada and Wallace, 1987; 
Southern, 1996). Primer extension by DNA polymerase incorporates bases one at a time only if 
5 they match the complement strand. Ligasc has similar requirements: two oligonucleotides can be 
joined enzymati tally provided they both are complementary to the template at the position of 
joining, 

[02S2] Figs. UA-11B show the enhancement of PAP efficiency. Fig. 11 A. PAP is amplified 
with two oligonucleotides P* and U from duplex TU:UT template. Each of the four P's has a 

io ddA, ddT, ddG and ddC at the 3" terminus. The 3' terminal base is either specific to the 
complementary strand of the G or A alleles, or not matched. Fig. UB. Autoradiogram of PAP 
from the G/G, A/A and G/A genotypes of the human dopamine receptor gene. The radioactively 
labeled specific products of 461 bases (duplex PU:UP and excess autisense strand UP) are 
produced. Other side products UT and UT:TU are indicated. Note that TU:TJT derives from 

is annealing of excess radioactively labeled UT with ncn-radioactively labeled TU original 
template. 

10253] Figs. 12A-12E show the effect of P* length and mismatch on PAP efficiency. PAP was 
amplified with P* and U oligonucleotide (see Table 3). In each of Figs. 12A-12E, P*s have the 
sample 3" termini but are different in length. Fig, 12A. In lanes 1-4, the P+s matched and 

20 amplified the G allele, in lanes 5-8 f the P+s mismatched at the 3' termini but amplified the A 
allele. Fig. 12B. In lanes 9-12, the P*s matched and amplified the G allele. In lanes 13-16, the 
P*s mismatched at -i2 buses to the 3' terroini but amplified the A allele. Fig. 12C, In lanes 
17-20, the P*a matched and amplified the A allele. In lanes 21-24, the P*s mismatched at -2 
bases to the 3' termini but amplified the G allele. Fig. 12D. hi lanes 25-28, the P*a mismatched 

25 at -9 bases to the 3' termini but amplified the A allele. Fig. 12B. In lanes 29-32, the P*s 
mismatched at -15 bases to the 3' termini but amplified the A allele. The length effect is 
indicated as the yield ratio in one lane (L„) to the previous lane (L^). The length effect was not 
shown in lanes 5-S because the signals are at or close to the background. 
[02541 Fig. 13 shows PAP specificity with differently positioned F*s. PAP was amplified with 

30 a P+ and U oligonucleotide (see Table 4). The P* matched to and amplified the G allele in lanes 
2-7, but rnismatcheid to and amplified the A allele In lanes 9-15. Lanes 1 and 9 were PGR. control 
with Di(212)17 mer and U. Lanes 3 and 16 were extension control with only U. 
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[0255] Fig. 14 shows PAP specificity with differently mismatched P*s. PAP was amplified 
with a P* and U oligonucleotide (see Table 5). In lanes 2-7, the P* amplified the G allele with 
match or one mismatch. In lanes 9-15, the P* amplified the A with one or two mismatches. 
Lanes 1 and 9 were PCR control with D|(212)l7 mer and U. Lanes S and 16 were extension 
control with only U. 



EXAMPLE 3 
PAP Amplification From Genomic DNA 
{0256] This example illustrates PAP amplification directly from genomic DNA, The 
oligonucleotides used in this example are listed below. Lane numbers refer to lanes m Fig. 15. 
[0257] The downstream oligonucleotides in 0. 1 hM concentration are: 

Lane 1 : D,(204)25D 5' TCTGACTGACCCCTATTCCCTGCTT 3' (SEQ ID NO:43) 

Lane2:P*(206)24A & 5' TGACTGACCCCTATTCCCTGCITA* 3' (A allele specific; 

SEQ ID NO: 44} 

Lane 3: P*(204)2fiG° 5 1 TCTGACTGACCCCTATTCCCTGClTG* 3' (O allele 

specific; SEQ ID NO:45) 
Lane 4: P*{206)24G" 2 5' ACTGACCCCTATTCCCTGCTTGGG* 3' (G allele specific; 

SEQ ID NO:46) 

Lanc5:P*(228)26A" S 1 TAGGAACTTGGGGGGTGTCAGAGCCC * 3' (A allele 

specific; SEQ ID NO:47) 
[0258] The opposite upstream oligonucleotide in 0.1 yiM concentration is: D t (420)24U 
5' ACGGCAGCACAGACCAGCGTGTTC 3 1 (SEQ ID NO:48), which was paired with each 
downstream oligonucleotide. See Footnotes of Table 3 for details. 

[0259) The other components were the same as in Example 2, except for the following: 0.5 U of 
each of AmpliTo^FS and Taq DNA polymerases, and 100 ng of heterozygous G/A allelic 
genomic DNA were used per 25 u 1 reaction by using 30 cycles, 

10260] The PAP product size range from I93bp to 218 bp. One double stranded and one single 
stranded product was observed on the gel, indicating the exhaust of PP{ hydrolyzed by the 
contaminated thermostable pyrophosphatase. 
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EXAMPLE 4 
Comparison of Specificity of LM-PCR and LM-PAP 
10261] The LM-PCR protocol includes primer extension, linker ligation. PCR amplification, and 
directed labeling in the human dopamine Dj receptor gene model system (Fig. Ifj). LM-PCR 
was performed with the addition by terminal deoxynucleotidyl transferase (TdT) (this protocol is 
known as TD-PCR) on UV-treated genomic DNA samples essentially as described {Ffeifer et 
al., 1999), except that Vent* (exo-) DNA polymerase was used in the first 10 cycles of primer 
extension (PI primer - 5' TTGC C AC TC AAGC GGT CCTCT CAT J {SEQ ID NO:49)). 
Temperature cycles were 1 min at 95° C t 3 min. at 63° C, and 3 min at 72° C. To enhance the 
signal, terminal transferase was added to the protocol, and this variation of LM-PCR is called 
TD-PCR. Dynabeflds were used to eiyich target DNA molecules before terminal 
deoxynucleotidyl transferase {TdT) tailing. PCR was performed using Expand Long Template 
PCR System 3 (BMB) as described hy the manufacturer (P2 primer = 5 T GAAGCAATCTGGCT 
GTGCAAAGTC 3' (SEQ ED NO;50)>. The PCR products were purified using QlAquiek JPCR 
Purification Kit (QIAGEN) before performing the direct labeling. A portion of the cleaned PCR 
product was used for direct labeling wirn AmpllTaq DNA Polymerase (Perkin-Elmer) with 3, P 
labeled primers: 

P3A: (5' TCTGACTGACC CCTATTC CCTGCTT A 3 T (SEQ ED NO:5l; the y terminal 

deoxynucleolide is A allele specific) Bud 
P3G: (5' TCTGACTGACCCCTAITCCCTGCTTG 3" (SEQ ED NO:52; the 3' terminal 

deaxynucleotidc is G allele specific). 
[0262] LM-PAP was performed as allele-specific PCR except for ihe direct labeling step by 
PAP (Fig. 16A). The purified PCR product was used for direct labeling with 32 P labeled primers: 
P3A: 5' TCTGACTGACCCCTATTCCCTGCTTA* 3 T (SEQ B) NOr53; the 3 T terminal 

deoxynucleotide is A allele specific) and 
P3G: 5' TCTGACTGACCCCTATTCCCTGCTTG* V (SEQ ID NO:54; the 3' ternunaS 

deoxynucleotide is G allele specific) 
using PAP reaction conditions in a 10 \\\ volume (50 mM KCI, 10 mM Tris/HCl (pH 7.6),L5 
mM MgCU, 100 |*M of each dNTP, 0.1 uM P*, 300 jjM Na^PPj, 2% DMSO, 0.25U each of 
AmpIilhgFS and AmpliJa? DNA Polymerases (Perkin-Elmer). The cycling conditions were 
94° C, 10 sec; 60° C, 1 rnin. and 72" C, 2 min. for a total of S or 16 cycles. LM-PAP was 
dramatically more specific than LM-PCR. The initial data with the dopamine Dl gone shows a 
lower background with LM-PAP than with the identical unblocked oligonucleotide with LM- 
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PCR. Also, LM-PAP can be performed with the PGK gene, a gene with, a very high GC rich 
region (70%) (Fig. l&B). 

[0263] Fig, I6A shows a UV footprintmg of the dopamine Dl receptor gene with a comparison 
of allele- specific LM-PAP and aUele-specific LM-PCR. A direct comparison of LM-PAP with a 
P* and LM-PCR with an unblocked primer of identical sequence shows that two alleles can be 
distinguished with LM-PAP, but not with LM-PCR, Both methods were performed on HF-IG 
DNA that was untreated (Q } in \>itro treated (T) or in vivo treated (V) with UV. The direct 
labeling reaction using PAP conditions (lanes 7-18) with "P labeled primers P3A* (lanes 7-9 
and L 3- 15) and P3G+ (lanes 10-12 and 16-18) was done with AmpliTaqFS and AmpliTaq for S 
and 16 cycles, For LM-PCR the direct labeling reaction was done with AmpliTaq (lanes \-6) 
and 32 P-labeied primers P3A (lanes 1-3) and P3G (lanes 4-6) for S cycles. Allelic primers P*s 5 
P3 A* and P3G* for LM-PAP clearly distinguish the two alleles, while unblocked allelic primers 
of identical sequence, P3A and P3G, were unable to distinguish the alleles by LM-PCR. 
|0264| Fig. 168 shows a UV footprinting of the pgK gene. The LM-PAP procedure for PGK 
was essentially the same as for the dopamine Dl receptor except that Pfu Turbo DNA 
poiymerase was used in the primer extension, as well as 7-deaza-dGTP/dGTP in a 3; I ratio. 
Temperature cycles were 95° 1 mm., 60° 2 mm., and 76° 3 min. The PCR step was performed 
using Vent (exo-) DNA Polymerase at 97° 1 min.> 60° 2 min., 76° 3 min. also with deaza dGTP. 
The purified PCR products were used for direct labeling with the 32 P P3G* and P3C* primers 
using PAP reaction conditions in a 25 ul volume (50 mM KCL, 20 mM Hepes, pH 6.95, 10 mM 
(NRikSO* 1.5 mMMgCb., 40 uM dNTP, 150 uM Na*PPi, 4%DMSO, and 1 unit of AmpliTaq 
FS DNA Polymerase, The conditions for cycling were 94" 15 see,, 60° 30 sec., and 72° 1 min. 
for 10 cycles. 

EXAMPLE 5 

Optimization of PAP- A to Detect a Mutation in 1 of 1G 4 -1 0 s Templates 
[0265] One ug of lambda phage DNA contains 2 x 10 ID copies of template. The specificity of 
PAP is determined by mixing one part mutant lacJ templates with 10* to 10* parts control DNA 
templates, e,g,, wild-type lad. The specificity of PAP-A is a function of the error rate of the 
polymerase, the purity of P* (<2 x 10" 4 by current purification protocol) and The potential for 
damage of the DNA template in the extraction process. The yield and specificity of PAP is 
optimized by testing enzyme type and concentration and the concentrations of other 
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components, such as dNTP, PP|, Mg** or Mrt**. Hotstart PAP using antibody-activated enzyme, 
such as DNA polymerase, at room temperature can be used to eliminate spurious amplifications. 
[0266J Wild- type and mutant lambda phage DNA, which are used in the laboratory as a model 
system to study spontaneous mutation in mammals, are prepared from infected E. colt SCS-S 
s cells (Nishino et al, 1 996). The lambda phage is grown under high fidelity conditions and DNA 
is isolated with care under conditions, with low rates of DNA damage (StTatagene manual) 
(Nishino et al, 1996; Hill et al., 1999). 

[0267] The mutants include one example of each of the two types of tranai lions, the four types 
of trans versions and a one-base nucleotide deletion, P*s specific for each of the mutations is 

10 synthesized. These DNA templates are used for reconstruction experiments in which mutated 
DNA is serially diluted into wild-type DNA. The spiked samples are used to optimize PAP-A. 
The most robust polymerases are chosen based on yield and specificity using TaqFS. 
Thermo Sequenase, and SequiTherm Excel H (Epicentre). Other components of the reaction are 
optimized systematically, including thermccyclmg parameters, oligonucleotide length* and 

is reagent concentrations of PP r , dNTP and Mg* 4 or Mn*\ Quantitative detection of the yield of 
PAP product is achieved with autoradiography or fluorescence on a SSCP gel. These data aids in 
the optimization of PAP-R and LM-PAP (below). The optimization of these various parameters 
result in a specificity of 1 part in lOMo*. 

(02681 The optimised conditions are also tested fcr detecting mutations in the human factor IX 
20 gene by mixing human mutant genomic DNA templates With Up to 10* wild-rype templates. As 
with the lambda experiment, exponential PAP is performed with appropriately designed 
oligonucleotides (using OligoS software) tor 40 cycles and atrong signal is achieved by 
autoradiography or by fluorescence detection. 

25 EXAMPLE 6 

Optimization of PAP-R 

[03691 In a model system, mismatches along the length of F* inhibits activation, even when 
the mismatch is two nucleotides from the 5' end (Fig. 14). An additional set of IB rncrs of P*s, 
whose 5' termini were displaced 2, 6, 9, and 12 nucleotides downstream, also showed inhibition 
30 of activation (Fig. 13). In addition, 20 and 22 mers also show inhibition with single nucleotide 
mismatches (Fig. 12). To extend these findings and to lay the foundation for a robust method of 
rescquencing, the relationship between the location of single base mismatches and activation of 
P*sis analyzed further. 
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[0270] The factor DC gene is used as a model system because more than 1,000 DNA samples 
from hemophilia patients and family members have been ascertained from previous work on the 
molecular epidemiology of germline mutations in humans (Sommfcr, 1995; Ketterling et aL, 
1999). Two 20-nuclcotide regions of exon B and exon H in the human factor FX gene are used 
as model systems. The region of exon B is designed from nucleotides 6460 to 6479 (5' 
CGAGA AGTTTTTG AAAAC AC 3' (SEQ ID NQ:55;Yo3hitakc et aL, 19S5), within which eight 
different single base mutations are available. The region of exon H is from nucleotides 30845 to 
30S64 (5 T OAACATACAGAGCAAAAGCG 3' (SEQ ID NO:56), within which seven mutations 
at different positions are available. P*s identical to wild-type regions B and H will be 
synthesized. Identical P*s are synthesized, with the exception of a single nucleotide mismatch. 

The wild-type factor DX sequence is used in the initial studies. A few p*s that match the 
wild-type sequence or that mismatch at selected sites within the 5' third of the oligonucleotide 
sequence are helpful in performing pilot experiments to assess the optimal length of the 
oligonucleotide. The effects of polymerases and reaction conditions can be assessed. 
[UZ72J From preliminary data, it appears that IS mers or larger may be an optimal size. It is 
also possible that 2$ mers or even 30 mers may be optimal. For the present example, it is 
assumed that 20 mers are an optimal size. Wild type P* and twenty P*s with one of the possible 
single base mismatches at each nucleotide of the position region of exon B are synthesized. 
Eight of these P* are a perfect match to a mutation in a patient with hemophilia B* As positive 
controls, it is shown that these P*s activate efficiently when the appropriate mutated DNA 
sample is used. Exponential PAP and linear PAP are performed and the noise rate is determined. 
The noise: rate for linear PAP is generally lower tod is used, 

[0273] To confirm preliminary data in another sequence context, a similar experiment is 
performed in exon H. The seven mutations in that region of exon H are analyzed in a blinded 
manner to determine if the precise match is detected. The effects of the position of the mismatch 
or the type of mismatch on P* activation is determined. The effects of different polymerase^ 
reaction temperature, and other reaction conditions can also be determined. Another set of 20 
P*s provides additional data from mismatches 11-20 nucleotides from the 3' terminus. 

EXAMPLES 
Optimization of LM-PAP 
[02741 The human dopamine D| receptor gene and the mouse Pgkl gene are used as model 
systems lo compare the analysis of chromatin structure when LM-PAP or LM-PCR is utilized. 
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The dopamine D E receptor gene has been described above. X chromosome inactjvation occurs at 
an early embryonic stage. Since the two alleles in female cells maintain a different expression 
status, this is an advantageous system for studies of gone regulation. Pgkl is an X-lraked 
housekeeping gene encoding phosphoglycarate kinase (PGK), PGK is an important enzyme in 
5 glycolysis and the gene is expected to he active all the time except in the inactive X 
chromosome (Xi) of female somatic cells and in male germ cells. 

[0275] The preliminary data shows a dramatic enhancement of specificity with LM-PAP relative 
to LM-PCR in the dopamine Di receptor gene, a gene not previously analyzed for chromatin 
structure (Fig. 16A). In this example, LM-PAP and LM-PCR are performed. Three sets of 

to oligonucleotides that generated LM-PCR profiles and seven sets of primers that generated LM- 
PCR profiles with unacceptable background in the Pgki (and other X -chromosomal genes) are 
used to compare LM-PAP with LM-PCR. D coxy -terminating and dideoxy-terminating 
oligonucleotides of identical sequence are utilized lo perform LM-PAP and LM-PCR, 
respectively. The level of signal relative tc background is also quantitated by a Phospholmager. 

is The average si glial- to-noise ratio is determined. Optimisation data derived from analyses with 
PAP-A and PAP-R are also useful in the LM-PAP protocol. LM-PAP is optimized for the two 
regions to determine if the signal-to-noise ratio can be reduced further. 

EXAMPLE 8 

s 0 Optimization of Allele-S pacific LM-PAP 

[0276} Polymorphic sites of pgkla and lb gene in both coding and non-coding regions have 
been reported (Boor et ft]., 1990), These are Used to design the allelo-specific P*. One ailele- 
specific oligonucleotide is chosen prospectively from the Pgkl gene and one is chosen 
prospectively from the dopamine Di receptor gene. Blocked and unblocked oligonucleotides of 

25 identical sequence are synthesized and allele-specific LM-PAP and LM-PCR are performed, 
respectively. The signal to noise ratio is quantitated and compared. 

EXAMPLE 9 
PAP-R cn a Microarray 

3 0 [0277] The initial experiment will fbcus on the two 20 nucleotide regions of exons B and H as 
described above. The experimental design of PAP-R is similar to the experiments described 
above, except for digital light-direct synthesis ofP* oligonucleotides on amicroairay, e.g., with 
the Geniom® instrument. A total of 160 oligonucleotides are synthesized complementary to 
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wild-type and to all the single base mismatches for 20 bp regions of axons B and H of the factor 
IX gene. Aa a positive control, 160 oligonucleotide, each out of registered by one nucleotide, 
are synthesized to match exactly an adjacent 160 bp region of the factor IX gene. Genomic DNA 
from wild -type and mutant samples is amplified, annealed to the oligonucleotides and primer 
extension will be performed with a fluorescent dideoxy terminator. The protocol is optimised for 
the solid support. Adjustment of primer length, enzyme utilized and reaction conditions is 
performed such that most, if not all,, of the oligonucleotides that mismatch the two 20 bp 
nucleotide regions of factor IX generate little if any signal* while most of the 160 control 
oligonucleotides generate a strong signal. 

{0278J One strategy for re&equencing is shown in Figs. 3 and 4. Bach nucleotide in the 
complementary strand of the predetennined sequence is queried by four downstream P*s, such 
as 20 mers, which have identical sequence except for the 3 P terminus, which is either ddA, ddT, 
ddG or ddC. For a i itb segment, 4,000 P*s are needed in the dowustream direction. In the 
second set of experiments, exons B and H of the factor DC gene are resequenced. Samples from 
more than 200 patients with different mutations in these regions are available for analysis. False 
positives and false negatives are assessed by blinded analysis. Heterozygous female samples are 
available for many of the mutations. For the remaining male patient samples one to one mixing 
experiments with wild- type or a second mutated sample generates the equivalent of 
heterozygotes or compound heteTozygotes, respectively. Subsequently, all the regions of likely 
functional significance (the putative promoter region, the coding regions, and the splice 
junctions) are resequenced {2.2 kh). Since mere Iban 600 independent mutations are available, it 
is possible to determine whether more than 99*A of all sequence changes are identified (the 
sequence changes in these samples have been determined by direct sequencing over the course 
of a decade). 

J0279] A ?* with a single base substitution at the 3' tenrrinus generates a signal at the position 
of hemizygous or homozygous point mutations The mutation also creates a "gap" of no PAP 
signal, which spans a region of several successive nucleotides. When a single base substitution 
occurs, the gap size (nucleotides) + 1 ■ the length of the 3' specific subsequence (Figs. 3 and 4). 
[0280J To analyze samples with higher G+C content (55%), mutations in the lad gene are 
utilized. These mutations from the Big Blue® Transgenic Mouse Mutation Detection System, 
have the potential to facilitate the definition of a strategy that detects more than 99.9% of 
mutations, since more than 6,000 mutations are available in this system. The relevant regions are 
analyzed with, the help of robotic devices. In addition, hundreds of mutations or polymorphisms 
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are available for analysis in other genes with G+C contents of 30-75%. The dystrophin gene is 
particularly amenable to testing performance under conditions in which megabases of sequence 
require scanning. In this gene in which 90 segments aTe amplified by a robotic device, virtually 
all sequence variants have been denned by DQVAM-S followed by DNA sequencing. This is 
s advantageous because many molecular epidemiological and molecular diagnostic applications 
benefit from resequencmg that detects virtually 100% of the mutations. 

EXAMPLE 10 

PAP Amplification Directly from Human and Mouse Genomic DNAs 

10 10281) PAP was performed with each of two P"b, PI* (SEQ ID NO:45, G allele specific)) or 
P2* (SEQ ID NO:47, A allele specific) and an upstream unblocked primer (U; (SEQ ID NO:48) 
to amplify 180-bp segments of the Di dopamine gene. The P* are 26-mers with ddC and ddG at 
the 3* termini. 100 ng of human genomic DNA was amplified for 35 cycles followed by 2% gel 
electrophoresis. The PAP reaction mixture contained a total volume of 25 u.1: 50 mM KC1, 20 

is mM HEPES/NaOH (pH 6.9 at 25 B C), 10 mM (NH,) Z S04, 1.5 mM MgCl 2 , 40 uM each of the 
four dNTPs (cATP, dTTP, dGTP, dCTP), 0.1 uM U, 150 uM Na^Pi, 2% DMSO, 0-5 U of 
AmpliTbgFS polymerase (PE Applied Eiosystems), 0.5 U Taq polymerase and 100 ng of human 
genomic DNA. The cycling conditions were 94 °C for 15 sec, 65 °C for 30 sec and 72 "C for 1 
mirL Fig. 17A shows the results for PAP amplification of the Dj dopamine gene. In lanes 2 and 

20 5, PI* is specific for the A allele template at 24 nucleotides from the 3' terminus, so there is 
little or no discrimination between die G/G and A/A genotypes. In lanes 3 and 6, P2* is specific 
for the A allele template at 2 nucleotides from the 3* terminus, so there is specific amplification 
of the A/A genotype. Lanes 1 and 5 are PGR controls. Lanes 4 and 8 are negative controls 
without P*. Lane M is 120 ng cpx DNA/HAETV marker. 

25 102S2] Three Bi-PAP assays were tested directly from mouse genomic DNA. Bi-PAP was 
performed with two P*s containing a dideoxynucleotide blocker at the 3' terminus to amplify an 
80-bp segment of the lad gene. The P*s are specific to the wild-type template and are 40-42 
nucleotides long. In each of the three Bi-PAP assays, two opposite P* with one nucleotide 
overlap at their V termini were used to amplify 400 copies of the lad gene using 35 cycles.. 

3 o The sequences of the P*s are as follows; 

5' GAAGCGGCGTCGAAGCCTGTAAAGCGGCGGTGCACAATCT* 3» (SEQ ID 
NO:67) and 5' GCGG ATAGTTAATGATCAGCCCACTGAC GCGTTGCGCGAGA A 4 3' 
{SEQ ID NO: 63) in lanes I and 2; 



WO D3/095*64 PCT/TJStt3/ 14556 

62 

5* GATGGCGGAGCTGAATTACATTCCCAACCGCGTGGCACAA* 3' (SEQ ED 
NO;69 and 5' GGCAACGCCAATCAGCAACGACTGTTTGCCCGCGAGTTGT+ (SEQ ED 
NO:70) in lanes 3 and 4; and 

5' TACATTCCCAACCGCGTGGCAGAACAACTGGCGGGCAAAC* 3* (SEQ ID 
NO:7I) and 5' GGGCCAGACTGGAGGTGGCAACGCCAATCAGCAACGACTG* 3' (SEQ 
ID NO: 72) to lanes 5 and 6. 

10183] The PAP reaction mixture contained a total volume of 25 ul: 50 mM KCK 20 mM 
HEPES/NaOH (pH 6.9 at 25 a C) f 10 mM (NILiJjSO^ 1.5 mM MgCl 2l 40 uM each of the four 
dNTPs (dATP, dTTP, dGTP, dCTP), 0.1 uM U, 150 uM Najpi, 4% DMSO, 1.0 U of 
AmphTatjFS polymerase (PE Applied Eiosystems) and 400 copies of mouse genomic DNA. 
The cycling conditions were 94 °C for 15 sec, 65 °C for 30 sec and 72 n C for 1 min. The 
unincorporated P*s were separated well from the Bi-PAP product on 2% agarose geL No dirner 
was seen. Fig. 17B shown ihe results for these Bi-FAP assays. In lanes I, 3 and 5, the wild-type 
templates are amplified. Lanes 2, 4 and 6 are negative controls without mouse genomic DNA. 
[02841 Three PAP assays directly amplified ISO-bp segments of thp Dl receptor gene from 
human genomic DNA with strong signals of PAP products. The allele-specificity of 2<S-mer P* 
remains when the mismatch is at 2 nucleotides from the 3* tcrjiutrus, but the allele-specificity is 
lost when the mismatch is at 24 nucleotides from the 3' terminus. Three Bi-PAP assays directly 
amplified as low as four hundred copies of the iacl gene from mouse genomic DNA. The P+ 
oligonucleotides have different deoxynucleotides blocked at the 3* terminus and all can he 
efficiently activated. Addition of extra human DNA did not affect the amplification of the lad 
gene in mouse genomic DNA, The product of Bi-PAP was easily distinguished from 
unincorporated P* s . P* does not form dimmers because P+ needs long and perfectly matched 
regions at the 3' terminus for activation. 

EXAMPLE 11 
PAP with Acyclonucleotiies and Various Polymerases 
X Phage DNA Template 
[0285] The wild-type X. phage DNA template that contains an inserted wild-type lad gene of £. 
cpli (Kohler et al. P 1991) was purchased from Stratagene. The mutant X phage DNA template 
was prepared from X phage plaques transformed into SCS-8 E. coli cells ^ccordins to Maniatis, 
et al. (1982). It contained a T to G mutation at nucleotide 369 in the /n^gene. The amount of X 
phage DNA was determined by UV absorbance at 2fi0 cm. 
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Synthesis of P* by Adding Acyclonucleotide or a Dideoxynucleotide at the 3' Terminus 
[0286] The 3' terminal acyclonucleotide or 3' terminal dideoxynucleotide was: added to a 
deoxynuclootide oligonucleotide by terminal transferase. The mixture contained a total volume 
of 25 ul: 100 mM potassium cacodylate {pH 7.2), 2.0 mM CoCl 2l 0.2 mM DTT, 2 nM of the 
$ oligonucleotide, 2.4 mM acycloNTP (the molar ratio of the 3'-OK terminus to acycloNTP was 
1:30} (New England BioLabs), or 2.4 mM 2\3'-ddNTP (the molar ratio of the 3'-OH terminus to 
ddNTP was J :30)(Roche), 100 U Of terminal transferase (Invitrogcn). The reaction was 
incubated at 37 °C for 6 hr and then stopped by adding EDTA to a 5 mM final concentration. 
After desalting using a Centri-spin^ column (Princeton Separations), P+ was purified by 
io preparative 7 M urea/1 8% polyacrylamide gel electrophoresis with 30 mM 
triethRnolamine/tricine buffer (pH 7.9 at 25 P C) (Maniatis, et al., 1982-, Liu, et al.. 1999b). The 
amount of recovered P* was determined by UV absorbanoe at 260 run. 

[02871 Since small amounts of unterminated oligonucleotide would result in unexpected PCR 
amplification, the purity of P* was tested by the absence of PCR product at pH 8.3 in which 
is pyrophosphoxolysis ia inhibited. It is estimated that more than 99.99% of P* contained an 
acyeioiiwileotide or a dideoxynucleotide at the 3" terminus. 
PAP Amplification 

[02881 PAP was examined with P*l and 01 , with P*2 and 02, and with P"l and PZ* 
respectively (Fig. 18A and Tabic 6). The P*s were 30 or 35 nucleotide long and contained an 
20 acyclonucleotide or a didwjxyuMcleolidt at the 3' terminus. 
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[0289] The PAP reaction mixture with Ampli TaqFS DNA polymerase contained a total volume 
of 25 50 mM KC1, 20 mM HEPES/NaOH (pH 6.9 at 25 °C), 10 mM <NH*) a SO<, 1.5 mM 
MgCl 2 , SO uM each of the four dNTPs (dATP, dTTP. dGTP and dCTP), 0.1 uM of each 
oligonucleotide, 150 uM NsuPPi, 4% DMSO, 1 U of Ampli7b:?FS DNA polymerase (PE 
Applied Biosystems), 0,1 ng of the X phagB DNA template. The cycling conditions were 92 °C 
for 10 sec, 65 °C for 30 sec, and 72 a C for 1 min for a total of 30 cycles. A denaturing step of 92 
P C foe 1 min was added before the first cycle. 

[0290] The PAP reaction mixture with Vent (exo-) or Pfu (exo-) contained a total volume of 
25 M l: 10 mM KCL 20 mM HEPES/NaOH (pH 7,19 at 25 *C), 10 mM (NH^SOj, 1.2 mM 
MgCl 2 , 50 uM each of the four dNTPs (dATP, dTTP, dGTP and dCTP), 0.1 uM of each 
oligonucleotide, 150 uM Na^PP;, 4% DMSO, 1 U of Vent (exo-) DNA polymerase (New 
England BioLabs) or Pfii (exo-) DNA polymerase (Stratagene), 0.1 ng of the X phage DNA 
template. The cycling conditions were 94 °G for If sec, 60 °C for 30 sec, and 72 °C for 1 min 
for a total of 30 cycles, A denaturing step of 94 °C for 1 rain was added before the first cycle. 
10291] The product was electiophoresed through a standard 2% agarose gel. The gel was stained 
with ethidium bromide for UV photography by a CCD camera (Bic-Rad Gel Doc 1000). 
f0292] As shown above, TaqFS, a genetically engineered DNA polymerase (Innis and Gelfand, 
1999), greatly improved the efficiency of PAP. 3" terminal dideoxymicleotide blocked P*s can 
be activated by pyrophosphorolyeis to remove the 3 T terminal dideoxynucleotide in the presence 
of pyrophosphate (PPi) and the complementary atrand of the allelic template. Then the activated 
P* can. be extended by DNA polymerization. 

[0293 J PAP was performed with 3" acyclonucleotide blocked P*s by using X phage DNA 
containing the lad gene as model system. P*l end P*2 are downstream and upstream blocked 
oligonucleotides, respectively, for the same mutation (Fig. ISA and Table 6). The P*l and P*2 
have an acycloGMP and acycloCMP at their 3' termini, respectively. Amplification products 
were absent without pyrophosphate added at pH 8.3 where pyrophoephorolysis is inhibited, 
showing that P* I and P*2 were not directly extendible. 

[0294] P* I and P"2 are specific to the mutated template but mismatch to the wild-type template 
at their 3* termini. The mutated template was amplified efficiently by PAP with one 
acyclonucleotide blocked P* and one opposing unblocked oligonucleotide and by PAP with two 
opposing 3' terminal acyclonucleotide blocked P*s (lanes 1 and 2 in Fig. I SB), with two 
opposing acyclonucleotide blocked P*s (a special form of PAP where the two opposing P*s are 
overlapped at their 3' termini by one nucleotide) (Laud 3 in Fig. J SB). However, no product was 
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generated from the wild-type template because of the mismatch at the 3' terminus, showing the 
specificity (lanes 5-7 In Fig. 18B). PAP with the 3' dideoxynucleotide blocked P* showed 
similar results (lanes 9-16 in Fig. 18B), Direct sequencing analysis confirmed die correct 
sequence of the amplified product. The effect of P* length was also tasted. Similar results were 
obtained with 35-mer P*s that are co-terminal with the 30-mer P*s and five nucleotides longer at 
their $* termini (Fig. 18C). Other P*s specific for the wild-type sequence at the 3* terminus 
(with acycloTMP and ddTMP) were also tested with similar results. 

[0295] Family U DNA polymerases Vent (exo-) and Pfu (exo-) were tested using the above 
model system. With the acyclonucleotide blocker and perfect match at the 3" tenniiius, the 
mutated template was amplified efficiently by PAP with one P+ (lanes 1 and 2 in Pigs. 13D and 
1SB) and one opposing unblocked oligonucleotide and by PAP with two opposing P+s of P*l 
and P*2 (a special form of PAP where the two opposing P*s are overlapped at their 3" termini by 
one nucleotide) (lane 3 in Figs. 18D and 18E). Howevw, no product was generated from the 
wild-type template because P*l and P*2 mismatch the wild-type template at their 3* termini, 
showing the specificity (lanes 5-7 in Figs. 18D and IRE), Vent (exo-) and Pfu (exo-) 
polymerases could not amplify with the 3" dideoxymicleotide blocked P* (lanes 9-16 in Figs. 
1SD and I8E). Direct sequencing analysis confirmed the correct sequence of the P+l/Ol and 
F*2/0 2 products. Similar results were obtained with AcycloPol (Perlein-Elmer), a genetically 
engineered Family II archeon DNA polymerase. It is not clear why PAP with Vent (exo-) and 
Pfu (exo-) DNA polymerases discrmiinates against 3' dideoxynbomieleotide blockers. 
Other Blockers 

(0296] These results demonstrate that two terminators used in Sanger gequencing can be uaed as 
blockers in PAP. Terminators have also been described as therapies of viral illnesses, such as 
AIDS, and for cancer therapy, such as, S'-deoxyadenosine (cOrdycepin)> 3'-azido-3'- 
deoxythymidine (AZT). 2',3'-dideoxyinosirie (ddl), 2 , i 3 , -dideoxy-3 , -ttuacyUdine (3TC) and 2' ) 3 < - 
didebydt^-2\3'-dideoxvthymidme (d4T). DNA polymerase can incorporate their triphosphate 
form into the synthesizing strand, and the incorporation cause termination of the extension 
(Gardner and Jack, 1999, Cheng et al. f 1987; St. Clair et al., 19S7; Uetio and Witsuya, 1997). 
The monophosphate nucleotides of 3-^do^deoxytayniidine (AZT)* 2',3'-dideoxy-3 h - 
thiacytidine (3TC) and 2^3'-didebydra-2\3^deoxythymidine (d4T), when located at the 2< 
termini of oligonucleotides, can be removed "by pyrophosphoroiysis by HIV reverse transcriptase 
or its variants (Ariorl et al„ 1998; Gotte et aL, 2000; Meyer et al., 2000; Urban el al., 2001). 
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These results indicate the application of PAP for various types of blockers and for ENA 
templates. 

(0297) In summary, PAP amplification occurred efficiently and specifically with 3* 
acyclonucleotide and 3' dideoxynucleotide blockers using TagFS DNA polymerase, and only 
5 with acyclonucleotide blockers using Vent (exo-) and Pjv (exo-) DNA polymerases. Other 3' 
terminal nonextendible oligonucleotides and other DNA polymerases can be used, if the 3' 
terminal nucleotide can be removed by pyrcphosphorolysia, and the activated oligonucleotide 
can be extended. 

io EXAMPLE 12 

Detection of Extremely Rare Alleles by Bi-PAP 
X Phage DNA Template 

[02981 Tbe wild-type X phage DNA template that contains an inserted wild-type lad gene of E. 
coli (Kchler et aL, 1991) was purchased from Stratagene. Three mutated X phage DNA 

15 templates were prepared from X phage plaques transfomied into SCS-B E. colt cells according 
to Maniatis et al ( 1982) . They contain an A to T mutation at nucleotide position 190, a T to G 
mutation at nucleotide 369 and a T to C mutation at nucleotide 369 in the bet gene, 
respectively. The amount of?, phage DNA was determined by UV absorbancc at 260 nm. 
Synthesis of P* by Adding a 3' Dideoxynucleotide 

20 [0299] The 3' terminal dideoxynucleotide was added to an oligodeoxymicleotide by terminal 
transferase, The mixture contained a total volume of 25 pi: 1 00 mM potassium cacodylate (pl-l 
7.2), 2.0 mM CoCI 2 , 0.2 mM DTT, 2 nM of the oligonucleoude, 2.4 mM 2', 3'-ddNTP (the 
molar ratio of the 3>-OH terminus to ddNTP was l:30)(Roche), tOO U of terminal transferase 
(Invitrogen). The reaction was incubated at 37 °C for 6 hr and then stopped by adding EDTA to 

35 a S mM final concentration. After desalting using a Centri-apm" 30 column (Princeton 
Separations), P* was purified by preparative 7 M urea/16% polyacrylanude gel electrophoresis 
with 30 mM Triemsuiolammen:ricbie buffer (pH 7.9 at 25°C) (Maniatis et al . , 1 982 , Liu 
et al, 1 999b). The amount of recovered P* wa3 determined by UV absorbance at 260 nm. 
|03OOJ Since small amounts of untenninated oligonucleotide would result in unexpected PCR 

30 amplification, P* was 3i P-labeled at the 5' terminus by T4 polynucleotide kinase and then was 
electrophoresed through a 7 M urea/20% polyacrylamide gel. Only P* products were visible 
even when the gel was overexposed, It is estimated that more than 99.99% of P* contained a 
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dideoxynucleotide at the 3 f terminus. The purity of P* was supported by the absence of PCR 
product at pH S.3 in which pyrophosphorolyeis is inhibited. 
PAP Amplification 

[0301] Bi-PAP assays for nucleotide 190 and nucleotide 369 of the lad gene were examined. 
The P*s ware 40 nucleotides long except that the upstream P*s for position 369 are 42 
nucleotides. Each P* contained the sequence-specific nucleotide at the 3' terminus. The PAP 
reaction mixture contained a total volume of 25 ul: 50 mM KC1, 20 mMHEPES/NaOH (pH 6.9 
at 25° C), 10 mM (NHOjSOj, 1.5 mM MgCl 2 , 40 pM each of the four dNTPs (dATP, dTTP, 
dGTP and dCTP), 0.1 uM each P* 150 pM Na^P,-, 4% 0MSO, 1 pCi of [«-"P]-dCTP 
(3000Ci/mmole, Amersham), 1 U of Amplifo^FS DNA polymerase (PE Applied Biosystems), 
2,000 copies of the X phage DNA template, or slated elsewhere. Tbe cycling conditions were 
92° C for 6 sk, 68° C for 20 sec, and 72" C for 20 sec for a total of 35 cycles. A denaturing step 
of 92 5 C for I min was added before the first cycle. 

£0302} The product was electrophoieaed through a standard 2.5% agarose gel, and the gel was 
stained with cthidium bromide for UV photography by a CCD camera (Bio-Rad Gel Doc 1 000). 
[0303] In order to differentiate the rrnitaled product from the wild-type product of the same size, 
non-denaturing SSCP gel electrophoresis was performed (Orita et al., 1959), The reaction was 
mixed with two-fold volume of loading buffer (7M urea and 50% fonnatnide), boiled and 
rapidly cooled on ice. The product in 10 ul of the mixed reaction was electrophoresed through 
an 8% non-denaturing PAGE-PLUS (Amresco) gel with 30 mM Bthanolairiine/Capsco buffer 
(pH 9.6) (Liu et al., 1999b) at 4°C. The gel was dried and exposed lo Kodak X-OMAT™ AR 
film for autoradiography. Tluee or four bands from each amplified product were seen on a gel. 
The upper one or two bands were double strained DNA due to hybridization of denatured 
single-stranded segments during the electrophoresis as a result of the substantial amounts of 
amplified product present, Increasing the concentration of the amplified product further increase 
the intensity of the upper bands. 

Highly Efficient PAP Amplification 
10304) TaqPS, a genetically engineered DNA polymerase greatly improved the efficiency of 
PAP. The conditions of PAP were further optimised for dramatically higher efficiencies 
allowing PAP to amplify directly from a few copies of X phage DNA or human genomic DNA 
template. The reaction components and the thermocycUug regime were optimized, including: j) 
decreased concentrations of PPi in that keeping the PPi to dNTP ratio essentially constant, ii) 
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use of low pH HEPES buffer (pH 6.9 at 2S°C) } iii) addition of (NH 4 )iS0 3l iv) increased amount 
of TVi^FS, and v) higher annealing temperature. 
Bi-PAP 

[0305] PAP has a potential selectivity of 3.3x1 0 M :1 (Fig. 19). Approaching this potential 
s requires a design that eliminates confounding sources of error. The A190T mutation of the lad 
gene of X DNA is used as a model system. In PAP with one downstream P* and one upstream 
unblocked oligonucleotide, extension errors from the non-blocked upstream oligonucleotide can 
produce the rare mutation of interest, thus reducing the selectivity. If the mismcorporation rate 
of TaqPS is 10"* per incorporated nucleotide and one of the three possible rnisincorporations 

1 o gen erates the A-+T mutation on the newly synthesized upstream Strand, the seleoti vi ty decreases 

to 3.3X10" 3 due to the side effect. In order to remove this limitation, Bi-PAP was developed 
{Fig. 20A). In Bi-PAP, both the downstream and upstream oligonucleotides are P*s that are 
specific for the nucleotide of interest at their 3' termini. The P*a overlap at their 3' termini by 
one nucleotide. 

is [0306] Bi-PAP amplified efficiently and specifically at nucleotide position 190 using X phage 
DNA containing the lad gene as template (Fig. 20B). Addition of human genomic DNA did not 
affect the amplification. The 79-bp product of Bi-PAP was easily distinguished from 
unincorporated P*s. P* did not form dimers because P* needs a perfectly matched region at the 
3' terminus for activation. Similar results were observed at nucleotide position 369. Direct 

2 a sequencing analysis confirmed the correct sequence of the amplified product. 

Sensitivity and Selectivity of Bi-PAP 
[0307] In Order to demonstrate the extremely high selectivity of Bi-PAP, more than 10 10 copies 
of DNA template was used for a Bi-PAP reaction. X DNA containing the fad gene of E.coli 
was chosen as the model system because lpg of X DNA contains 2xl0 10 vector genomes, while 

2 5 lpg of human genomic DNA only contains 3.3xl0 5 genomes. In order to avoid potential 
contamination of the wild-type X DNA in thie laboratory, mutation-specific Bi-PAP assays with 
mutated P*s were chosen to amplify the wild-type X DNA. The relative frequency of a 
spontaneous mutation of the lad gone in the wild-type X DNA is estimated to be less than 10" ? 
by examining X phage plaques infecting coli. 

30 £0308] The sensitivity and selectivity of Bi-PAP were examined using three mutation-specific 
Bi-PAP assays with their corresponding mutated X DNA (see Table 7 footnotes for definitions). 
Four titration experiments were performed for each mutation-specific Bi-PAP assay (Figs. 2lA- 
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21 C). Experiment I tested how much the mutated P* can "tolerate" the wild-type DNA template 
(ie., the maximum copies of the wild-type template without a detectable mutated product). The 
wild-type X DNA was titrated from 2xl0 10 copies to 2x1 0 1 * copies. The maximum tolerances 
were 2x10 s to 2xl0 10 f 2xl0 7 to 2x lo\ and 2x1 0 7 to 2x 10*, respectively, for the three mutation 
specific Bi-PAP assays respectively (Figs. 21A-21C). Experiment II tested the sensitivity of Bi- 
PAP. The mutated X DNA was titrated from 2x10 s to 0 copies. The ratio of the maximum 
tolerance (Experiment I) to the sensitivity is the selectivity. Experiment H was repeated in the 
presence of large amount of wild-type template (Experiment IIT) or large amounts of human 
genomic DNA (Experiment IV) without effect* (Fig. 21A; data not shown for T369G and 
T369C). A dose response with template copy number was observed. 



TABLE 7 

Summary of ibe three imitatioiNspecific Bi-PAP assays* 



Assay 


Position 1, 


Type' 


Sensitivity 0 


Selectivitity 0 


A 


190 


A:T^T:A 


2 


10":l to 10 |LJ ;I """ 


B 


369 


T:A-y&C 


2 


10':1 to 10°:1 


C 


369 


T:A-»-C:G 


2 


10':1 to 10 rf :l 



L lit each of the three mutation-specific Bi-PAP assays > two opposite P+s with one nucleotide overlap at 
their 3' termini were used. The P*s are 40-42 nucleotides long. They are 
5 ' GAT GGCGGAG CTG AATTAC ATTC CC AACCGC GTGGCAC AT - (SEQ ID NO:61) and 
5' GGCAACGCCAATCAGCAACaACTGTITGCCCGCCAOTraA* (SEQ ID NO;62) in Assay A; 
S'GAAGCGOaTrCOAAGCCTGTAAAGmGCOGTGCACAATCG* {SEQ TD NO:63) and 
5' GCGGA7AGTTAATGATCAGCCCAC TGACGCGTTGCGCGAGAC* (SEQ ID NO:64) in Assay 
B' 5' G A AGC GGC GTCGAAG CCTG1 AAAG CGGCGGTOC ACAAT C C* (SEQ ID NO:65) and 
5' GCGGATAGTTAATGATCAGCCCACTGACGCGTTGCGCGAG AG* (SEQ ID NO:66) in Assay 

Ch 

b The position of the first nucleotide of transcript in the lad gene is assigned the nucleotide position 1 
(Parabaugh, 1978), The 3' nucleotide of the P* is located at the indicated position and is complementary 
to the corresponding mutation. 

c The sensitivity is defined as the minimum copies of the mutated template ftom which a detectable 
mutate product is generated when a Tnutation.specific Bi-PAP assay is used. It was determined by 
Experiment 0 (Figs. 21A-21Q, 

d The selectivity i$ the ratio of the maximum copies of the wild-type template with undetectable product 
the minimum copies of the mutated template with detectable product to, when a mutation-specific Bi- 
PAP assay is used> 

[0309] The 3p P rqximately lOO.fold difference in selectivity between the nucleotide positions 
190 and 369 may derive from: i) the presence of spontaneous mutations at the position 360 at a 
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frequency of 10" 7 to 10~* m the wild-type X DNA, ii) impurity of P* oligonucleotides, iii) 
specificity of pyrophosphorolysis for a perfect match et the 3' temiinus and fidelity of DNA 
polymerase to incorporate a correct nucleotide may be associated with sequence context such 
that the Type II non-specific amplification occurs at a frequency of 10' 7 to 10^, In the latter 
5 case, a 100-fold difference in selectivity could arise from a 10-fold difference in 
pyrophosphorolysis specificity and a 10~fold difference in DNA polymerase fidelity with 
sequence context. 

[03101 The rate of a spontaneous mutation of X phase in E, coli varies from locus to locus, on 
the nverage from Id' 9 to 10~ M per incorporated nucleotide. The amplified signal seen in 

10 Experiment I might be caused by rare spontaneous mutations. 

[0311] There is a possible side reaction due to the impurity of P* contamination of unblocked 
oligonucleotide where the dideoxy terminus has not been added, although no unblocked 
oligonucleotide was detected in the P*. However, this selectivity may not be ihnited severely by 
small amounts of unblocked oligonucleotide because the product generated would be much 

is more likely to be the wild-type rather than the specific mutation (3.3XJ0 5 : 1). 

[0312| In summary, Bi-PAP has extremely high sensitivity and selectivity. Bi-PAP can 
selectively detect two copies of rare mutated allele with a sulfite base substitution from up to 
2x10* copies of the wild-type allele. Bi-PAP is a simple, rapid, automatable method for 
detecting any rare allele of interest. 

20 

EXAMPLE 13 

Measurement of Mutation Loud in Mouse Tissues by Bi-PAP 
Materials and Methods 

[0313] Liver, heart, adipose tissue, cerebrum and cerebellum from iO~day to 25-month old mice 
a 5 were snap frozen and stored under liquid nitrogen until used. DNA was extracted according to 
the Big Blue protocol (Stratagene instruction manual). In brief, tissues were homogenized and 
digested with proteinase fC. The genomic DNA was extracted with phenol/chloroform and 
precipitated with ethanol. The DNA was dissolved in TE buffer (10 mM Tris/HCl, 1 mM 
EDTA, pH 8.0) and stored at 4 °C. The amount of the mouse genomic DNA was determined by 
3 o UV absorbance at 260 nm. 

[03141 The mutation-specific Bi-PAP assay for T3G9G {Assay B: the two opposite P*s are 
dideoxynucleotide blocked with one nucleotide overlap at their V termini are: 
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5'GAAGCGGCGTCGAAGCCTGTAAAGCGGCGGTGCACAATCG*3' (SEQ ID 
NO:63) and 

5' GCGGATAGTTAATGATCAGCCCACTGACGCGTTGCGCGAGAC*3' (SEQ m 

NO:64) 

of the fac/ gene was performed as above except that i) the reaction contained 2 pg of (he mouse 
genomic DNA (-20 kb in size), unless otherwise stated; ii) mouse DNA in 20 ui (1.25x HEPES 
buffer, 5% BMSO without MgClj) was heated at 100 o C for 2 min and quickly cooled on ice, 
before the other components added; iii) a denaturing step at 95°C for 1 min was added before the 
first cycle; iv) the denaturing step was 95 P C for 10 sec. 

(D3J 5] LO m I of the 25 ul reaction was mixed with 10 u! of the denaturing loading buffer, boiled 
and rapidly cooled on ice. The product was electrophoTesed through a &% 7M urea /PAGE gel 
with 90 mM TBE buffer at room temperature The gel waa dried and exposed to Kodak X~ 
DM AT™ AR film for autoradiography, 
Results and Discussion 

T0316] Transgenic mouse mutation detection systems permit determination of the frequency and 
pattern of spontaneous or induced mutations in vivo. The Big Blue* system uses, transgenic mice 
harboring chromosomally-integTated X phage DNA containing the £. coli lad gene as the 
mutational target (Grossen and Vijg, 1993; Goasen et ah, 1989; Kohieret al., 1990. The lad 
gene is integrated within each mouse diploid genome in 40 tandemly repeated X DNAs. 
[0317] The Big Blue* mutation detection system assay is performed by isolating genomic DNA 
from transgenic mouse tissues and mixing it with X packaging extracts. The packaged X phage 
can infect E. colt. In the presence of X-gal substrates (acl mutants give riae to blue plaques on a 
background of colorless wild-type plaques. Observed mutants derive overwhelrrtingly from the 
mouse (Hill et al., 1999). The mutant frequency is determined by dividing the number of circular 
blue plaques by the total number of plaques. Of 5000 sequenced mutant plaques, 31 T369G 
mutants have been found in a total of 149x10 s plaques screened from various ages, genders and 
treatments in this laboratory (frequency =■ 2.1x.I0" 7 ). 

[03181 To assess the utility of Bi-PAP for measuring ultra-rare mutations in mammalian cells, 
the T369G mutation was analyzed in genomic DNA from the Big Blue mice. Two pg of mouse 
genomic DNA was amplified in 25 pi reaction containing a total of 1.2xl0 7 copies of the lad 
gene. The mutation-specific Bi-PAP assay far T369G (Assay B) was performed for 18 samples 
in duplicate (Fig. 26A). Three categories of results were defined, each with similar number of 
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samples: 1) six samples were positive two times (5, 11-15), 2) seven samples were positive one 
time (1, 3, 6, 9, 1 6-18), and 3) five aamples were negative two times (2, 4, 7, 8, 10). 
(0319J Two samples in each category were studied further (Figs. 26B-26C, Table 8), In category 
1, for the two samples 5 and 12 with the strongest amplified signals (Fig. 26A), a four-fold 
S dilution to 0.5 ug and 16- fold dilution to 0.1 25 ug of mouse genomic DNA were performed for 
further quantitation (Fig. 26B). The T369G mutant frequency for each sample was estimated 
and varies 370-fold among the six samples (Table 8). The average T369G mutant frequency of 
2.9x1 0" 7 was within 50% of the average T369G mutant frequency of 2.1 xlO' 7 measured from 
4xlO J plaques using the Big Blue* mutation detection system and confirmed by direct 
10 sequencing. 

TABLE 8 



Somatic mutant frequency measured by Bi-PAP 





Sample* 


Mouse genomic DNA 


Frequency of positive amplification 1 "" 


Estimated 
mutant 

ft*equtney u 






Tissue 


Age 


2ug of 
DNA 


0.5 p,fi Of 
DNA 


0.125 ug of 
DNA 




1 


12 


Adipose 


6 months 


8/8 


8/8 


4/8 (0.<59) c 


9.25x10" 


2 


5 


Liver 


25 months 


8/8 


7/8 


5/8 (0.98) 


1.31x10^ 


3 


3 


Liver 


25 months 


8/24(0.41) 






3.38x10- 


4 


9 


Liver 


25 months 


13/24 (0,78) 






6.50x1 0\ 


5 


7 


Liver 


25 months 


2/24(0.09) 






7.25xl0" v " 


6 


10 


Liver 


25 months 


1/24 (0.04) 






3.52x10-" 


Average 












2.91xl0 7 



IS b the ratio of the number of positive signals for the T369G mutation relative to &x lata] number of 
reactions. 

c the average number of T369G mutants per reaction is estimated using a fbtmul* (the frequency of zero 
mutants per reaction - e"' , x is the average number of mutants per reaction) suppose that the mutant 
distributes in tiie reaction according to a Poisson distribution and that if one or more mutants are in the 

2 o reaction, the amplification is positive, and if zero mutant is in the reaction, it is negative. 

the frequency of the T369Q mutant of the lad gene in mouse genome per reaction i& estimated 
assuming that the mutant distributes in mouse genomic DNA according to a Poisson distribution and that 
one or more mutants are positive in the detection. For each of aamples 12 and 5, a total of -6.0xl0* r 
copies of the lad gene are used for the estimate, and for each of samples 3, 9, 7 and 10. -2,3x1 0 1 copies 

2S are used assuming that 2 iigqf the lac? mouse genomic DNA oontains-J.2xlO T copiesof the la^/gene. 



WO n3/0936G4 PCT/USIW145K 
74 

[0320] The 370-fold variation in mutant frequency was observed in livers of five mice at 25 
months of age. This large variation could be dye to difficulties in amplifying one copy of the 
template. To address this issue, each of die analyses was repeated at least two times with similar 
resulta. For example, in sample 9, seven of 14 reactions with 2 ug of DNA were positive in one 
experiment^ three of four such reactions were positive in another experiment, and two of four 
auch reactions were positive in a third experiment. For sample 7, there was One positive in eight 
and one positive in 14 reactions. The product was sequenced to confirm the T369G mutation 
after re-amplification from the positive reaction. In addition, positive controls (2 pg of the lad + 
mouse DNA with -10 copies of T369G) aud negative controls (mouse genomic DNA without 
the lad target, i.e„ the lad " mouse DNA) were performed. As additional positive controls, 
reconstruction experiments were performed in that the copy number of the mutated A DNA per 
reaction was serially diluted by two-fold in the presence of the lacT genomic DNA carrier. 
Reproducible amplifications from as low as one copy of template were demonstrated (Figs. 26B, 
26C), 

10321] In retrospect, the 370-fbld variation in the frequency of T369G mutant observed among 
the six mice may not be surprising because the T369G mutant frequency among mice is over 
dispersed, implying a hyper-Poisson distribution (Nishino et al., 1996; Piegorsch et al„ 1994). 
Among six mice the inter-animal variation in the overall mutant frequency assayed by the Big 
Blue* mutation detection system might be 3 to 4 fold, with significant founder effects in one or 
a few of the mice. The variation might be in the range of SxlO" 3 to SxlO" 3 which is the sum of 
more than 1,000 different mutations. Here r only the T369G mutation is assayed. It is anticipated 
that the great majority of the signal derives from duplex mutated templates (Hill et aL + 1999), 
but it Should be noted that unresolved mismatch intermediates derived primarily from DNA 
replication or DNA repair would also gtneratc a signal. Thus, the physical limit of sensitivity is 
actually one half of a duplex DNA molecule per teaction. 

(0322] In conclusion, we demonstrate that Bi-PAP can analyze ultra-rare mutations at 
frequencies as low aa 10" 1 to 10" 9 , depending on the assay. It is shown that Bi-PAP can delect 
smglo copies of the somatic mutation directly from mammalian genomic DNA. The inter-assay 
variation may reflect locus- specific variability in the assay sensitivity ot in the frequency of the 
assayed mutants among the samples. More work is necessary to' distinguish between these 
possibilities. In mammalian DNA the numbsr of copice of tqinplate is limited by the enormous 
genome size. Two v»g of genomic DNA contains only 600,000 mouse haploid genomes, yet the 
reaction is viscous. Our analysis of the Big Blue mouse genomic DNA was facilitated by the 20 
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copies of the lad gene per haploid genome. To measure mutation load in humans, genomic 
DNA in one reaction could be increased at least three fold by reducing the viscosity {eg., 
shearing the DNA into small segments by ultrasonic treatment) and another four fold by 
expanding the reaction volume to 100 pi. Mutation load hi human genomic DNA might be 
5 facilitated by analyzing segments of virtually identical sequence. e.g „ there are three 9.6 kb 
segments with 99 + % sequence identity on human X chromosome involved in a common 
inversion mutation in hemophilia A (Lalrich et &L t 1993). Less complex genomes including C- 
elegarss, Drosophila, and human mitochondria genome or chronic viral infections (e.g., hepatitis 
B) aiso should be analyzable with this protocol. 

to 

[0323] While the invention has beeu disclosed in this patent application by reference to the 
details of preferred embodiments of the invention, it is to be understood that the disclosure is 
intended in an illustrative rather than in a limiting sense, as it is contemplated that modifications 
will readily occur to those skilled in the art, within the spirit of the invention and the Scope of 
IS the appended claims. 
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WHAT IS CLAIMED IS: 

1 . A method of detecting a nucleic acid which comprisies: 

(a) annealing to a nucleic actd a first oligonucleotide P*, wherein the first 
oligonucleotide P* has a non-extendable 3' end, wherein the 3 'non-extendable terminus 
of first oligonucleotide P* is removable by pyrophosphoroiysis; 

(b) removing the 3' non-extendable terminus of the first oligonucleotide P* by 
pyrophosphorolysis to produce an unblocked first oligonucleotide; 

(c) extending the unblocked first oligonucleotide; 

(d) detecting the prc&ence of the nucleic acid by detecting the extended first 
oligonucleotide. 

2. The method of claim 1, wherein pyrophosphorolyBis ie performed with a nucleic acid 



3. The method of claim 1 , wherein extension is performed with a nucleic acid polymerase. 

4. The method of claim I, wherein a nucleotide or nucleotide analog present in the 
extension step contains a label and the presence of a Jabel in the extended first 
oligonucleotide is detected. 

5. The method of claim 1 , wherein steps (a), (b) and (c) are repeated. 

6. The method of claim 1, wherein the 3' non-extendable terniinus of first Oligonucleotide 
P* is a nucleotide or nucleotide analog which cannot be extended by a nucleic acid 
polymerase but which can be removed by pyrophosphorolysis. 



The method of claim 6, wherein the nucleotide or nucleotide analog is selected from the 
group consisting of a 3 4 deoxynunleotide, a 2*,3'-dideoxynucleotide, an acyclonucleotrde, 
y-deoxyadencsbie (cordycepin), S'-azido-S^deoxythymidinc (AZT), 2',3 , -dideoxyinosine 
(ddl), 2 F ,3 , -dideoxy-3 , -thiacytidtne (3TC) and 2 , ,3'-didehydro-2' t 3'-didcoxytbymidine 
(d4T). 



The method of claim 1, wherein a second oligonucleotide 
complementary strand of the nucleic acid. 



9. The method of claim B, 
oligonucleotide. 



wherein the second oligonucleotide is an unblocked second 



10, The method of claim 9, wherein the second oligonucleotide is extended in step (c). 

11. The method of claim 9, wherein the second oligonucleotide is a second oligonucleotide 
P* which has a non-extendable 3' end, wherein the 3' non-extendable terminus of the 
second ofigonuc?eotide P" is removable by pyrophosphoroiysis. 



12. The method of claim 11, wherein the 3' non-extendable terminus of the i 
oligonulceotide is removed by pyrophosphoroiysis in step (b) to produce an 
unblocked second oligonucleotide and wherein the unblocked second oligonucleotide is 
extended in step (c). 

13. The method of claim 11, wherein the 3'non-extendable terminus of the second 
oligonucleotide T* is a nucleotide or nucleotide analog winch cannot be extended by a 
nucleic acid polymerase but which can be removed by pyrophosphoroiysis. 

1 4. The method of claim 13, wherein the nucleotide or nucleotide analog is selected from me 
group consisting of a 3'deoxynucleoridc, a^'^'-dideoxynucleotide, an acyclonucleotide, 
3' -deoxy adenosine (cordycepin), 3'-azido-3'-deoxythyTnidine (AZT), 2',3'-dideoxyiuosine 
(ddl), 2',3'-dideoxy-3 , -thiacytidine (3TC) and ^'-didehydro^S'-dideoxythymidme 

<d4T> 

15- The method of claim 1 1, wherein the 3 'non-extendable terminus of first oligonucleotide 
P* is a nucleotide or nucleotide analog which cannot he extended by a nucleic acid 
polymerase but which can be removed by pyrophosphoroiysis and wherein the 3'non- 
extendable terminus of the second oligonucleotide P* is a nucleotide or nucleotide 
analog which cannot be extended by a nucleic acid polymerase hut which can be 
removed by pyrophosphoroiysis. 



16. The method of claim 1 5, wherein the nucleotide or nucleotide analog is selected from the 
group consisting of aS'tleoxynucleotide, a 2 t ,3 , -dideoxynneleotide, an acyclonucleotide, 
3 T -deoxyadcnoaine (cordycepin), 3 t -arido-3'-deoxytoynTidine (AZT), ^'-dideoxyinosine 

5 (ddl), 2\3^dideoxy-3^thiacytidine (3TC) and Z.S'-didehydro-S'.a'-dLdeoxythymidinc 

<d4T). 

17. A method of synthesizing a nucleic acid which comprises: 

(a) annealing to a nucleic ncid a first oligonucleotide P* which has a non- 
10 extendable 3 ! end, wherein the 3' non-extendable terminus of the first oligonucleotide P* 

is removable by pyrophosphoroiysis; 

(b) removing the 3' non-extendable terminus of the first oligonucleotide E»* 
annealed to the nucleic acid by pyrophoephorolysis; and 

(c) extending the tmbloclccd first oligonucleotide using a nucleic acid 
is polymerase. 

18. The method of claim 17, wherein steps (a) to (c) are repeated. 

19. The method of claim 17, wherein pyrophosphoroiysis and extension are performed with 
20 a nucleic acid polymerase. 

20. The method of claim 17, wherein a second oligonucleotide is annealed to a nucleic acid 
strand which is the complement of said first strand. 

as 21. The method of claim 20, wherein the second oligonucleotide is an unblocked second 
oligonucleotide. 

22. The method of claim 21 , wherein the second oligonucleotide is extended in step (c). 
20 22. The method of claim 22, wherein ateps (a) to (c) are repeated. 
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24. The method of claim 20, wherein the second oligonucleotide is a second oligonucleotide 
P* which has a non-extendable 3 K end, wherein the 3' non-extendable terminus of the 
Second oligonucleotide P* is removable by pyrophosphoroiysis. 

25. The method of claim 24, wherein steps (a) to (c) are repeated. 



S3 

The method of claim 30, wherein the nucleotide or nucleotide analog ia selected from the 
group consisting of a 3 *deoxynucleotide, a 2 , ,3 1 »dideoxynucleotidc» an acyclonucleotide, 
3 r -deoxyadenosine (cordycepin), 3''azido-3'-deoxythymidiiie (AZT), J'^-dideoxyinosine 
(ddl), 2\3'-dideoxy-3'-tmacytidine (3TC) and 2\3^didehydio*2^3'-dideoxythyrmdine 
(d4T). 



26. The method of claim 17, wherein the 3'non-extendable terminus of first oligonucleotide 
P* is a nucleotide or nucleotide analog which cannot be extended by a nucleic acid 
polymerase but which can be removed by pyrophosphoroiysis 

27. The method of claim 26, wherein the nucleotide or nucleotide anatog is selected from the 
group consisting of a S'deoxynucleotide, a 2',3'-dideoxynucIeotide, an acyclonucleotide, 
3'-dcoxyadenosine (cordycepin). S'-azidoO'-deoxythymidine (AZT), ^'-dideoxyinosine 
(ddl), 2^3 , -dideoxy-3 , -thiacytidine (3TC) and 2;3'-didehydro-2\3'-dideoxythvmidine 

(<i4T). 

28. The method of claim 24, wherein the 3'non-exteildable terminus of second 
oligonucleotide P* is a nucleotide or nucleotide analog which cannot be extended by a 
nucleic acid polymerase but which can be removed by pyrophosphoroiysis. 

29. The method of claim 28, wherein the nucleotide or nucleotide analog is selected from the 
group consisting of a 3'deoxynuclcotide, a 2 , J 3 , 'dideoxynucleotide > an acyclonucleotide, 
3'-deoxyadenosine (cordycepin), S'-azido-S'-deoxv&ymidine (AZT), 2 , ,3 , -dideoxyinostne 
(ddl), 2\3 T -dideoxy-3 T -thiacytidine (3TC) and 2 , ,3 , .didehydro-2\3 , -dideoXythymidine 
(d4T). 

30. The method of claim 24, wherein the 3'non-extendable terminus of first oligonucleotide 
P* is a nucleotide or nucleotide analog which cannot be extended by a nucleic acid 
polymerase but which can be removed by pyrophosphoroiysis and wherein the 3'non- 
extendable terminus of the second oligonucleotide P* is a nucleotide or nucleotide 
analog which cannot be extended by a nucleic acid polymerase but which can be 
removed by pyrophosphoroiysis. 



32, A method of detecting a nucleic acid which comprises: 

(a) annealing to a nucleic acid two oligonucleotide P*s, wherein each 
oligonucleotide P* has a non-extendable 3' end, wherein the 3* non-cxtcndablc terminus 

io of each oligonucleotide P* ia removable by pyrophosphoroiysis, wherein one 

oligonucleotide P+ overlaps with the other oligonucleotide P* by at least one nucleotide 
at their respective 3' ends, and wherein one oligonucleotide P* anneals to a first nucleic 
acid strand arid the other oligonucleotide P* anneals to a nucleic acid strand which is the 
complement of the first nucleic acid strand; 

15 (b) removing the 3* non-extendable terminus of the annealed first and second 

oligonucleotide P+b by pyrophosphoroiysis to produce unblocked oligonucletides; 

(c) extending the unblocked oligonucleotides; and 

(d) detecting the extended oligonculeotides, 

io 33. The method of claim 32, wherein pyrophosphoroiysis is performed with a nucleic acid 
polymerase, 

34. The method of claim 32, wherein extension is performed with a nucleic acid polymerase. 

2S 35. The method of claim 32, wherein pyrophosphoroiysis and extension are performed with 
a nucleic acid polymerase. 

36. The method of claim 32, wherein a nucleotide or nucleotide analog present in the 
extension step contains a label and the presence of a label in the extended 

3 0 oligonucleotide is detected. 

37. The method of claim 32, wherein steps (a), (b) and <c) are repeated. 
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38. The method of claim 32, wherein the 3'non-cxtcndable terminus of each oligonucleotide 
P* is a nucleotide or nucleotide analog which cannot be extended by a nucleic acid 
polymerase but which can be removed by pyrophosphorolysis. . 

39. The method of claim 38, wherein the nucleotide or nucleotide analog is selected from the 
group consisting of a 3 'deoxyrmcleotide, a 2%3'-d£deoxvnucleotide, an acyelonucleotide, 
3 '-deoxy adenosine (cordyecpin), 3'-aadc~3'-deoxythynridine (A2T), Z'^-dideoxyinosine 
(ddl), S^dideoxy^-thiacytidine (3TC) and 2 , ,3'-didehydro-2',3 , -dideoxythvmidine 
(d4T). 

40. A method of nucleic acid detection which comprises: 

(a) synthesizing a template nucleic acid from the nucleic acid; 

(b) annealing to the template nucleic acid a first oligonucleotide P* which has a 
noil-extendable 3' end, wherein the 3* non-extendable terminus of the first 
oligonucleotide P* is removable by pyrophosphorolysis;. 

(c) removing the 3* non-extendable terminus of the first oligonucleotide P* 
annealed to the template nucleic acid by pyrophosphoro lysis; and 

(d) extending the unblocked oligonucleotide 

41. The method of claim 40, wherein Ihe template nucleic acid is synthesized by 
(i) annealing to one strand of the nucleic acid a first complementary oligonucleotide that 
is sequence specific and extending said first complementary oligonucleotide on the 
nucleic acid strand with a nucleic acid polymerase to produce an extended first 
oligonucleotide and (ii) adding a second oligonucleotide to a 3' terminus of the extended 
first oligonucleotide to produce a template nucleic acid. 

42. The method of claim 41, wherein the second oligonucleotide is added to the extended 
first complementary oligonucleotide by ligation. 



44. The method of claim 41 which further comprises the step (a I) amplifying the template 
nucleic acid. 

45. The method of claim 44, wherein the amplification is a polymerase chain reaction. 

s 

46. The method of claim 44, wherein the amplification ts by a pyrophosphorolysis activated 
polymerization. 

47. The method of claim 40, wherein the template nucleic acid is synthesized by a reaction 
io which synthesizes copies of nucleic acid. 

48. The method of claim 47, wherein the amplification reaction is a polymerization chain 



is 49, The method of claim 47, wherein the amplification reaction is a pyrophosphorolysis 
activated polymerization reaction. 

50. The method of claim 40, wherein pyrophosphorolysis is performed with a nucleic acid 
polymerase. 

20 

51. The method of claim 40, wherein extension is performed with a nucleic acid polymerase. 



52. The method of claim 40, wherein pyrophosphorolysis and extension are performed with 
a nucleic acid polymerase. 

53. The method of claim 52, wherein a nucleotide or nucleotide analog present in the 
extension Step contains a label and the presence of (he label in the extended 
oligonucleotide la detected. 



The method of claim 41, wherein the second complementary oligonucleotide is added to 
the extended first complementary oligonucleotide by terminal ti 



3 0 54, The method of claim 40, wherein steps (a) to (d) are repeated. 



WO 1*3/095664 PCT/U5DJ/I455fr 
86 

55. The method of claim 40, wherein the 3' non-extendable terminus of oligonucleotide P* 
is a nucleotide or nucleotide analog which cannot be extended by a nucleic acid 
polymerase but which can be removed by pyrophosphorolysis. 

56. The method of claim 55 s wherein the nucleotide or nucleotide analog is selected from the 
group consisting of a 3 f deoxynucleotide, a 2' ) 3'-dideoxynucleotide, an acycionuclcotide, 
3'-deoxyadenosine (cordycepjn), S'-addo-S'-deoxythvmidinB (AZT), 2\3'-dideoxyirw3ine 
(ddl), 2S3'-dideoxy-3^tmacytidine (3TC) and 2^3 T -d^^>hyd^o.2^3^dideoxythy^nidine 
(d4T), 

57. The method of claim 40, wherein a second oligonucleotide is annealed to the template 
nucleic acid in step (b). 

58. The method of claim 57, wherein the second oligonucleotide is an unblocked second 
oligonucleotide which is extended in step (c->. 

59. The method of claim 57, wherein the second oligonucleotide is a second oligonucleotide 
P* which has a non-extendable 3" end, wherein the 3* non-extendable terminus of the 
second oligonucleotide P* is removed by pyrophosphorolysis in step (c) to produce an 
unblocked second oligonucleotide and wherein th= unblocked second oligonucleotide is 
extended in step (cl). 

60. The method of claim 59, wherein the 3'non-extendabie terminus of second 
oligonucleotide P+ is a nucleotide cr nucleotide analog which cannot be extended by a 
nucleic acid polymerase but which can be removed by pyrophosphorolysis.. 

61. The method of claim 60, wherein the nucleotide or nucleotide analog is selected from the 
group consisting of a 3'daaxynucleQtide, a 2' > 3 , -dideoxynucleolide, an acyelonucleotide, 
S'-deoxyadenosine (cordycepin), 3'-a7idcO'-decttythyniidine (AZT), 2 T ,3'-dideaxyinosme 
(ddl), 2\3'-dMeoxy4'-thiacvtidiM (3TC) and 2\3^didehydro-2 , ! 3 r -dideoxythyniidme 
(d4T). 
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62.. The method of claim 59> wherein the 3 'non-extendable terminus of second 
oligonucleotide P* is a nucleotide or nucleotide analog which cannot be extended by a 
nucleic acid polymerase but which can be removed by pyrophosphorolysia.. 

& 63. The method of claim 62, wherein the nucleotide or nucleotide analog is selected from the 
group consisting of a 3'deoxynucleotide, a 2',3'*dideoxynucleotide, an acyctonuclcotide, 
3 T -deoxyadenosine (cordyecpin), 3 1 • az ido -3 1 • d eoxythymidine (AZT), 2 , ,3 , -dideoxy£noslne 
(ddl), 2',3'-dideoxy-3 T -thiacytidine (3TC) and Z^'-didehydro-y^'-dideoxyihymidinc 
(d4T>, 

10 

64. The method of claim 59. wherein the 3 'non-extendable terminus of first oligonucleotide 
P* is a nucleotide ot nucleotide analog which cannot be extended by a nucleic acid 
polymerase but which can be removed by pyrophosphorolysis., 

15 65, The method of claim 64, wherein the nucleotide or nucleotide analog is selected from the 
group consisting of a 3*deoxynucIeQtide, a I'.S'-dideoxynucIeotidB, an acyclonucleotide, 
3'- deoxy adenosine (cordycepin), 3'-azido-3'-deox;vmyrnidine (AZT), 2\3'-dideoxyinosine 
(ddl), 2 , ,3'-dideoxy-3 , -thiacytidine (3TC) and 2 l ,3 , »didehydro-2 , ,3'*dideoxythyraidjne 
(d4T). 

20 

65. The method of claim 40, wherein the nucleic acid is a nucleic acid that has been cleaved. 

67. The method of claim 40, wherein the nucleic acid is a double stranded nucleic acid that 
has a lesion or a nick. 

2S 

68. A pyrophosphorolysis activated polymerization method of synthesizing a desired nucleic 
acid strand on a nucleic acid template strand which comprises serially 

(a) annealing to the template strand an activatable oligonucleotide P* that has a 
non- extendable 3'-deoxynucleotide at its 3' terminus and has a mismatch with respect to 

30 the corresponding nucleotide on the template strand, 

(b) pyrophosphorolyzing the resulting duplex with pyrophosphate and an enzyme 
that has pyrophosphorolysis activity and activates the oligonucleotide P* by removal of 
the tonninal 3'»deoxynuclecmde, and 
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(is) polymerizing by extending the activated oligonucleotide P* on the template 
strand in presence of four nucleoside triphosphates and a nucleic acid polymerase to 
synthesize the desired nucleic acid strand, 



The pyrophosphorolysis activated polymerization method of claim 6S that includes 
amplification of the desired nucleic acid strand by 

(d) separating the desired nucleic acid strand of step (a) from the template strand 

and 

(e) repeating steps (a)-(d) until ft desired level of amplification of the desired 
nucleic acid strand is achieved. 

The pyrophosphorolysis activated polymerization method of claim 69 carried out in the 
presence of a second oligonucleotide that in step (a) anneals to the separated desired 
nucleic acid strand product of step (d), and wherein step (c) includes polymerizing by 
extending the second oligonucleotide on the desired nucleic acid strand to synthesize a 
copy of the nucleic acid template strand, and step (d) includes separating the synthesized 
nucleic acid template strand from the desired nucleic acid strand, so that amplification is 



The pyrophosphorolysis activated polymerization method of claim 70 wherein a 
mismatch between the activatable oligonucleotide ?* and the template strand occurs 
with respect to the corresponding nucleotide on the template. 

The pyrophosphorolysis activated polymerization method claim of 69 wherein steps (a) 
to (c) are conducted sequentially as two or more temperature Stages on a thermocycler. 
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<210> 6? 
<211> 40 
<212> DMA 
<213> Artificial 




<qo0> 63 

yaegcggcqt cgaagcctgt aaagcggcgg tgeacaateg 


4Q 


<220> 

<223> Oligonucleotide 




<210> 64 

<211> 42 

<2I2> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 




<220> 

<221> inisc feature 

<222> (40). .(40) 

<223> dldeoxynucleotida 

<400> 67 

gaageggect cgsagcctgt aaagcggcgg tgeacaaret 


40 


<2Z0> 

<22l> misc feature 

<222?> ^4^}..(42} 

<223> dideoxynucleOtlde 




<211> 42 

<212> DWA 

<213> Artificial 





WO MWSMU 



PCT/USD3/ 14JK 



WO 0 J/U9SMJ 



<220> 

<Z23;> Oligonucleotide 
<2 20> 

<2 21> misc_fB&tur<l 

<222> (42).. (42) 

<22S> didsoxsmucleofcide. 

<400> eg 

gcggatagtt satgntcagc ccactgacgc gttgcgcgag ; 



<210> 69 

<211> 40 

<212> DMA 

<213> Artificial 



<22Q> 

<220> 
<221> 
<r222> 
<223> 



OligamjclaotidB 



misc_feature 
(40)7. (40) 
[<i dEBxynucleohide 



<400> e& 

gatgacggag cfcgaattaca ttcccaaccg cgtggcacaa 



<210> 


70 


< 2 2 1 > 


40 


<212> 


DNA 


<213> 


Artificial 


<2 20> 




<223> 


Oligonucleotide 


<220> 




<221> 


misc feature 


<222> 


(40) . . (40) 


<223> 


didcoxynucleotide 


<4O0> 


70 



ggcaacgcca ateagcaaCg actgtttgcc cqcctittqt 



t&catLceca accgcgfcgge 



a ct g gcgggcaa a c 



<21Q> 


12 


<2ll> 


4C 


<212> 


DNA 


<213> 


Artificial 


<Z20> 




<223> 


Oligonucleotide 


<220> 




<221> 


mise feature 


<222> 


(40) . . (40) 


<223> 


dideo xynu cl e ot ide 


<4Q0> 


72 



gggccagact ggaggtggca aegccaatea gcaacgactg 



<:210> 


71 


<2ll> 


40 


<212> 


DNA 


<213> 


Artificial 


<22€> 




<223> 


01 igonucleotide 


<220> 




<221> 


mi so feature 


<222> 




<223> 


dida okynu cl eo tide 



